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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 




(D) TOPOLOGY: linear 
Hi) MOLECULE TYPE: DNA i genomic) 
(xi) SEQUENCE DESCRIPTION: SEC! ID NO : 1 : 
3AGAGATGGG GGAGOCTAAC TGAG 



24 



(2) INFORMATION FOR SSQ ID NO : 2 : 

:i) SEQUENCE CHARACTER I ST I 2S : 
■A) LENGTH: 2 8 base pairs 
iB) TYPE: nucleic acid 
(0) STRANDEDNESS : single 
f D ) TOPOLOGY : 1 inear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:2: 
GATCCTCAGT TAGCOT TCCC CATCTCTC 2 8 

(2) INFORMATION FOR SSQ ID NO : 2 : 

i i ) SE I'UENCE CHARACTER : STI 2S : 

(A) LENGTH: 5 5 base pairs 

(B) TYPE: nucleic nzil 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 

Hi) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
T7VTATCTCCA GATGAGGTAC ATGATTTTAG GCTTG 3 5 

(2) INFORMATION FOR SEQ ID NO : 4 : 

{ i ) SEQUENCE CHARACTER I STI CS : 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 4 : 
TATATATCGA TTCAAGGCAT TTTCTTTTCA TCAATAAAAC 4 0 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 
TCGAGGATCC GCCCGGGCGG CCG CATC GAT GTCGACG 3 7 

(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
CGCGTCGACA TCGATGCGGC CGCCCGGGCG GATCC 3 5 



(2) INFORMATION FOR SEQ ID MO: 7: 

( i ) sequen :e : i haracter:sti cs •. 

(A) LENGTH: 77 base pairs 

(B) TYPE: nu:*leic acid 
(0) S TR AN D E D N S S S : single 
i"D) TOPOLOGY: linear 

(ii) MjLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCR I PT ION' : SEQ ID NO : 7 : 
AGTGAATTOG AGCTCGGTAC OCGGGGATCC TCTAGAGTCG ACCTGCAGGC ATGCAAGCTT 6 0 

GGCGTACTOA TGGTCAT 77 

(2) information for seq id no:8: 

( i ) SEQUENCE CHARACTER I ST I :s : 

( A ) L ENGTH : ^ am i n j a : i as 

(B) TYPE: amine acid 

(0) STRANDEDNSSS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: pro:ein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 
Ala Arg Glu Mer Gly Glu Al 3 Asn 
1 5 

(2) INFORMATION FOR SEO ID NO : 9 : 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNSSS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 
CCCGAGAGAT GGGGGAGGCT AACTGAG 2 7 

(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 1 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNSSS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 0 : 
GGGCTCTCTA CCCCCTCCGA TTGACACCTA G 31 

(2) INFORMATION FOR SEQ ID NO : 1 1 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 1 : 
Thr He Met Thr Met 
1 5 



(2) INFORMATION FOR SEQ ID NO: 12: 
(i) SEQUENCE CHARA2TERISTI 2S : 
(A) LENGTH: 24 base pairs 
■'.£) TYPE: nucLei:: acii 
■C) 3TRANDEDNE5S: single 
(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID ND:12: 
CCCT3TGCCT TATTTGAACT AACC 24 

(2) INFORMATION FOR SEQ ID NO : 1 3 : 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
1 3) TYE'E: nucleic acii 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(xi) SEQUENCE DESCRIPTION': SEj ID NO: 13: 
COCACCACAA CCAOATATCO CTCC 2 4 

(2) INFORMATION FOR SEQ ID NO: 14: 
( i ) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: 19 base pairs 
(p.) TYPE: nucleic acid 

(C) STRANDSDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
CCAGTCCTCC GATTGACTG 19 

(2) INFORMATION FOR SEQ ID NO: 15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8332 base pairs 
(3) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GOGCCAGTCC TCCGATTGAC TGAGTCGCCC GGGTACCCGT GTATCCAATA AACCCTCTTG 6 0 

CAGTTGCATC CGACTTGTGG TCTCGCTGTT CCTTGGGAGG GTCTCCTCTG AGTGATTGAC 12 0 

TACCCGTCAG CGGGGGTCTT TCATTTGGGG GCTCGTCCGG GATCGGGAGA CCCCTGCCCA 18 0 

GGGACCACCG ACCCACCACC GGGAGGTAAG CTGGCCAGCA ACTTATCTGT GTCTGTCCGA 24 0 

TTGTCTAGTG TCTATGACTG ATTTTATGCG CCTGCGTCGG TACTAGTTAG CTAACTAGCT 3 00 

CTGTATCTGG i:GGACCi:GTG GTGGAACTGA CGAGTTCGGA ACACCCGGCC GCAACCCTGG 36 0 

GAGACGTCCC AGGGACTTCG GGGGCCGTTT TTGTGGCCCG ACCTGAGTCC AAAAATCCCG 42 0 




ATCGTTTTG 3 AGTCTTT3GT GCACCCCCCT TAG A "J GAG '3 '3 ATATGTGGTT 3T3GTAGGAG 480 
A03A3AA2GT AAAAOAGTTG GGGCGTGGGT CT3AATTTTT GCTTTCGGTT TG 3GAC 3 3AA 54 J 

GGCGG 3CCGG GGGTGTTGTC TGGTGCAGGA TCGTTGTGTG TTGTGTGTGT GTGAGTGTGT 6 00 

TT3TGTATTT GTGTGAGAAT ATG3GGGA3A GT3TTACGA3 TGGCTTAAGT TTGA 3CTTA 3 >><>0 
GT3ACTGGAA AGATGTGGAG CGGATGGOTC AGAAGGA 3TC GGTAGATGTC AA3AAGAGAG 72 0 

GTTGGGTTAC CTTCTGCTCT GGAGAATGGG CAACCTTTAA GGTGGGATGG CCGCGAGAGG 78 0 

GCAC 3TTTAA G:GAGAGGT 3 ATGAGG3AGG TTAAGATGAA GGT 3TTTTGA CCTGGGGGjZ 8 4 0 

ATGGAGAGGG AGAGGAGGTG GGGTAGATGG TGAGGTGGGA AGGGTTGGGT TTTGACC3GC 0 00 

OTGGGTGGGT CAAGCCCTTT GTACACCCTA AGCCTCCGCC TCGTGTTGOT CCATGGGGGG \h-\0 

GGTGTGTGGG CCTTGAACCT 3GTGGTTG3A CGGGGCGTGG AT G3TGGCTT TATCGAGGGC 103 0 

TCACTCCTTC TGTAGGGGGC AAACCTAAAC CTCAAGTTCT TTGTGA3AGT GGGGGGGGGC IOmO 

TCATCGACCT ACTTACAGAA GACCCCCCGC GTTATAGGGA CCGAAGACCA CCCCCTTCCG 114 0 

AGAGGGACGG AAATGGTGGA GAAGGGAGOC GTGGGGGAGA GGCACCGGAC CGCTCCCCAA 12 00 

TGGCATCTCG CCTACGTGGG AGACGGGAGC CCCCTGTGGC CGACTCCACT ACCTCGCAGG 12*' Q 

CATTCCCCCT CCGCGCAGGA GGAAACGGAC AGGTTGAATA CTGGCCGTTC TCCTCTTCTG 13.':0 

ACCTTTACAA CTGGAAAAAT AATAACCCTT CTTTTTCTGA AGATCCAGGT AAACTGACAG 13*0 

CTCTGATCGA GTCTGTTCTC ATCACCCATC AGCCCACCTG GGACGACTGT CAGCAGCTGT 14 4 0 

TGG(3GACTCT GCTGACCGGA GAAGAAAAAC AACGGGTGCT GTTAGAGGCT AGAAAGGCGG 1500 

TGGGGGGCGA TGATGGGCGC CCCACTCAAC TGCCCAATGA AGTGGATGCC GCTTTTCCCC 15t0 

TCGAGCGCCC AGAGTGGGAT TACACCACCC A(3GCAGGTAG GAACCACCTA GTCCACTATC 162(1 

GCCAGTTGCT CCTAGCGGGT CTCCAAAACG CGGGCAGAAG CCCCACCAAT TTGGCCAAGG 1680 

TAAAAGGAAT AACACAAGGG CCCAATGAGT CTCCCTCGGC CTTCCTAGAG AGACTTAAGG 174 0 

AAGCCTATCG CAGGTACACT CCTTATGACC CTGAGGACCC AGGGCAAGAA ACTAATGTGT 1800 

CTATGTGTTT CATTTGGCAG TCTGCCCCAG AGATTGGGAG AAAGTTAGAG AGGTTAGAAG 1 8 6 0 

ATTTAAAAAA CAAGACGCTT GGA(3ATTTGG TTAGAGAGGC A(3AAAA(3ATG TTTAATAAAG 192 0 

GAGAAACCCC GGAAGAAAGA GAGGAACGTA TCAGGAGAGA AACAGAGGAA AAAGAAGAAG 198 0 

GCCGTAGGAC AGAGGATGAG CAGAAAGAGA AAGAAAGAGA TCGTAGGAGA CATAGA(3AGA 2 04 0 




TjAjCAAGCT ATTGGCCACT GTCGTTAGTG 
GAA3GAG3T2 C 2AAGTGGAT C G : 3AC 2AGT 
GTAAAGATTG TCCCAAjAAA GGAGGAGGAG 
TGAGGGTAGA TGACTAGGGA GGTGAGGGTG 
TGAAAGTGGG GGGGGAAGGG GTOACCTTCC 
TGACCCAAAA TGGTGGA:GG G TAAGT jATA 
jAAAGGGGTA TGGGTGGAGG AGG 3ATGGCA 
AGTGTTTGGT GGATGTAGGA GACTGTCCCT 
AAGTAAAAGG GGAAATGGAG TTTGAGGGAT 
AG AGTGTTGAGG GTAAATATAG 

AAGAGGGA3A TGTTTGTGTA GGGTGGAGAT 
AAACCGGGGG CATGGGACTG GCAGTTCGCC 
CCTCTACCCC CGTGTCCATA AAAGAATAGG 
AGGGGGAGAT ACAGAGA :TG TTGGACCAGG 
AGAGGGGGGT GCTACCCGTT AAGAAACCAG 
TGAGAGAAGT CAACAAG CGG GTGGAAGACA 
TCTTGAGCGG GCTCCCACCG TCOCACCAGT 
TTTTCTGCCT GAGACTCCAC CCCACCAGTC 
CAGAGATGGG AATCTCAGGA GAATTGAGGT 
GTCCCACCCT GTTTGATGAG GGAGTGGAGA 
CAGACTTGAT CCTGCTACAG TAGGTGGATG 
ACTGCCAACA AGGTACTCGG GCCCTGTTAC 
CGGCCAAGAA AGCCCAAATT TGGGAGAAAG 
AGGGTCAGAG ATGGCTGACT GAGGCCAGAA 
AGAGGGGTGG AGAAGTAAGG GAGTTCCTAG 
CTGGGTTTGC AGAAATGGCA GGCGGGTTGT 
ATTGGGGCCC AGACCAACAA AAGGCCTATC 




GACA 3AAAGA GG ATAGACAG ggaggagaag 2 100 

3T3GGTA2T3 GAAAGAAAA3 Jj3CACT3GG .:i- : .o 

aagagg:ga3 AGGTGGGTGG 2 2 2) 

AGGAGCCCCC GGGTGAAGGG AGGATAACCC 22 8 0 

TGGTAGATAG TGGGGGGGAA CACTCCGTGC 2^4 0 

A j TGTGGGTG GGTG 2AAGGG GGTAGTGGAG 2 -MO 

AAGTAGATGT A3GTAG:GGT AAGGTGAGGG 24 '.0 

ATCCTCTGTT AGGAAGAGAT TTGGTGAGTA 2:^20 

GAGGAGGTGA GGTTATGGGA GGAATGGGGG 2^>*0 

AAGATGAGGA TGG 3GTAGAT GAGAGGTGAA 2 64 0 

GGGTGTGTGA TTTTGGTGAG GGGTGGGGGG 27 00 

AAGCTCCTCT GATCATACCT GTGAAAGGAA 21**0 

OCATGTCACA AGAAGGGAGA GTGGGGATGA 282 0 

GAATAGTGGT AGGGTGGGAG TGGGGGTGGA 288 0 

GGAGTAATGA TTATAGGCCT GTCCAGGATC 2y4 0 

TCCACCCCAC CGTGCCCAAC CCTTACAACC 3 000 

GGTAGAGTGT GGTTGATTTA AAGGATGCCT 3 0 6 0 

AGCCTCTCTT CGCCTTTGAG TGGAGAGATC 312 0 

GGAGGAGACT CCCACAGGGT TTCAAAAACA 318 0 

GAGACCTAGC AGACTTGCGG ATCCAGCACC 324 0 

AGTTAGTGGT GGCCGCCACT TGTGAGGTAG 3300 

AAACCCTAGG GAACCTCGGG TATCGGGCCT 33 6 0 

AGGTCAAGTA TCTGGGGTAT GTTGTAAAAG 342 0 

AAGAGAGTGT GATGGGGGAG GGTAGTGGGA 34 8 0 

GGAGGGGAGG CTTCTGTCGC CTCTGGATCC 3 54 0 

AGGGTGTGAC GAAAAGGGGG ACTCTGTTTA 36 00 

AAGAAATGAA GCAAGCTCTT GTAACTGCGG 366 0 




cagccctggg gttgcca3at ttgactaagc 
g:ta:gggaa aggtgt:gta a:3gaaaaa: 
tgtggaaaaa ggtagaggga gtag "aggtg 
ggattggggt actgacaaag gatggaggga 
tggc02cc2a tggagtagag ggagtagtga 
gcgggatgag t tagtatgag gg 3ttggttt 
tggtaggggt gaaggggggt acgctgctcc 
gggttgatat ggtgggggaa g :g gag gg aa 
tagaggggga ggagaggtg5 tagagggatg 

AGGCGGGAGC TGGGGTGAGG AGGGAGAGGG 
GGACATCCGG TGAGGGGGGT GAAGTGATAG 
GTAAGAAGGT AAATGTTTAT AGTGATAGGG 
GAGAAATATA GAGAAGGGGT GGGTTGGTGA 
AGGAGATGTT GGGGGTAGTA AAAGCCCTCT 
GTCCAGGACA TCAAAAGGGA CACAGCGCCG 
GGGGGGGAAA GGGAGGGATG AGAGAGAGTG 
CATCACCCTA CACCTCAGAA CATTTTCATT 
AGTTGGGGGC CATTTATGAT AAAAGAAAGA 
TGCCTGACCA GTTTAGTTTT GAATTATTAG 
TGTGAAAAAT GAAGGGTGTG GTAGAGAGAA 
ATGGAACAGT CAAAAATATC ACTGAGACCT 
AGTCTGCCGT TAAACAGGGA AGTAGGGTGG 
TCGATTTCAC GGAGATAAAG GG GGGATTGT 
ATAGGTTTTG TGGGTGGATA GAAGCCTTCC 
GGAAGAAGGT ACTAGAGGAG ATGTTCGGGA 
ACAATGGGCC TGCCTTCGTO TCGAAGGTGA 
ATTGGAAATT ACATTGTGGA TACAGAGCCC 




G:TTTGAAGT GTTTGTG3AG GAGAAGGAGG 3 72 0 

t^gga:gttg ^cgt :ggc:g 3T3gggtagg 3 7-n 

GGTGGCCCCC TTG:GTAGGG ATGGTAGGAG 3 « 4 ■: ) 

AGCTAACCAT GGGAGAGGCA CTAGTCATTC 3 MO 

AA:AACCCCC CGACCGCT 2-2 GTTTGGAAGG 3 MO 

TGGAGAGGGA CCyGGTC^AG TTGG^AGGGG 4 ■.).:-) 

GAGTGGGTGA GGAAGGGGTG CAACACAACT 4(r-M 

CCCGACrr^A CCTAACGGAC CA2C'2GCTCC 4 14 0 

3AAGGAGTGT GTTA 1AAGAG GGAGA3GGTA 4 .MO 

AGGTAATGTG GGGTAAAGGG GTGGGAGGG 3 4 2 0 0 

CACTCACCCA GGGGGTAAAG ATGGCAGAAG 4 3;:0 

GTTATGGTTT TGCTACT 3CC CATATCCATG 4 3 80 

GATGAGAAGG CAAAGAGATC AAAAATAAAG 444 0 

TTGTGGGGAA AAGAGTTAGG ATAATGGATT 4 5 00 

AGGCTAGAGG GAAGGGGATG GCTGAGGAAG 4 50 0 

CAGACACCTC TACCCTCCTC ATAGAAAATT 4 b 2 0 

AGAGAGTGAG TGATATAAAG GACCTAACCA 468 0 

AGTATTGGGT CTACCAAGGA AAAGGTGTGA 4 74 0 

ACTTTCTTCA TGAGGTGAGT GAGGTGAGGT 4 8 00 

GCCACAGTCC GTAGTAGATG GTGAAGGGGG 4ft 6 0 

GGAAAGGTTG TGGAGAAGTG AAGGGGAGGA 4 0 2 0 

GGGGGGATGG GGGGGGGAGT GATTGGGAGA 4 'J HO 

ATGGCTATAA ATATCTTCTA GTTTTTATAG 5 04 0 

GAAGGAAGAA AGAAACGGGG AAGGTGGTAA 510 0 

GGTTCGGCAT GCCTCAGGTA TTGGGAACTG 510 0 

GTCAGACAGT GGGGGATGTG TTGGGGATTG 522 0 

AAAGGTGAGG GGAGGTAGAA AGAATGAATA 52 8 0 




gaac:atcaa ggagagttta actaaattaa 

tg3tagtgg: cttagccgtg taccgaggcc 

CAT AT' 5 AG AT GTTATATGGG G :ACCCCCGC 

GAAGAGTTAC TAACAGCCCC TCTCTCGAAG 

AGGAAGTCTG GAGACGTCTG GCG3CAG:CT 

CT GAGGCTTA CGGAGTGG 3 3 GA3A3A3T3T 

AAGCTGGCTG GAAAGGAGGT TACACAGTCG 
AGGGCATCGC AGCTTGGATA CACGCCGCCG 

GATCCTCTAG AGTGAGATGG CGCGTTCAAG 

CCCGCGAGGC CCCCTAATCC CCTTAATTCT 

GCCCGGCTCG AGTCCTGATG AAGTCTATAA 

GGAGAGGGTA TGGGCAAGTT CTGGCAACGA 

GCCAGATTTA TGTATGTTAG CCCACGATGG 

CGCTTTTTCT TCTCCGCCGG GGCGCCCTTG 

CAGAGACTGC GAAGAACCTT TAACCTCCCT 

ACTCAAGCTA GACCAGACAA CTCATAAATC 

CCACCGCCCC GGAGAATCCA AGTCATGTGG 

GGGCTGTGAG ACAACCGGTA GAGCTTAGTG 

AGTAAAGAAG AATCTCAGCT CTGACCAGGC 

CAACCCCTTA GTTATTCGGT TTACAGACGG 

ACATTACTGG GGCTTACGTT TGTATGTCTC 

CCGACTCAGA TACCAAAATC TAGGACCCCG 

AGACCAACAG CCACTCTCCA AGCCCAAACC 

CAGTGGGACT CCTCTCTCCC CTACCCAACT 

AAAGTTAGTA GACGGAGCCT ACCAAGCCCT 

GTGGTGGTTG TGTCTAGTAG CGGGACCCCC 

CTACTCCAAC CATACCTCTG CTCCAGCCAA 




CGCTTGCAAC TG3 3TGTAGA GACTGG 3TGG 53 4 0 

gcaacacgcg G3 3 3CGGG AT GGCCTCACCG 54 0 0 

CCCTTGTAAA CTTCCCTGAC CCTGACATGA 54 0 0 

CTCACTTAGA GGCTCTGTAG TTAGTCCAGC 55 2 0 

AGGAAGAACA ACTGGAGGGA GG 3GTGGTAG 55 4 0 

■3^TCCGZOZ AGATGAGAGT AAGAACCTAG 56 4 0 

TGCTGACCAC GGG :ACCGCC GTGAAAGTAG 5700 

AGGTGAAGGG T 3GG Z,AC ZC-2 GGGGGTGGAG 570 0 

GCTCTCAAAA CCCCTTAAAA ATAAGGTTAA 5 8 JO 

TGTGATGGTC AGAGGGGTCA GTAGTGCTTG 53*0 

TATCACCTGG GAGGTAAGGA ATGGAGATGG 5 34 0 

GGGTGTGTGG AGGTGGTGGG CTGACCTTAC 6 0 0 0 

ACCATCTTAT TGGGGGCTAG AATATGAATG 6 00 0 

TTGCTCAGGG GGGAGCAGGG GAGGGTGTTG 012 0 

CACCCCTCGG TGGAAGAGTG GGTGGAACAG 618 0 

AAATGAGGGA TTTTATGTTT GCCCCGGGCC 624 0 

GGGTCCAGAC TGGTTCTAGT GTGCCTATTG 6 3 00 

GAAGGCGTCC TCATCATGGG ATTTGATCAG 6 30 0 

TGTCCAGGTA TGGAAAGATA ATAAGTGGTG 642 0 

CGGGAGAGGG GTTACTTCCT GGACCACAGG 64 8 0 

GGGAGAAGAT CCAGGGCTTA CATTTGGGAT 6 54 0 

CGTCCCAATA GGGCCAAACC CCGTTCTGGC 66 00 

TGTTAAGTGG GGTTGAGTGA CCAAACCACC 6 66 0 

TCCACCGGGG GGAAGGGAAA ATAGGGTGGT 6 72 0 

CAACCTCACC AGTCCTGACA AAACCGAAGA 6 78 0 

CTAGTAGGAA GGGGTTGCCG TCCTGGGTAC 6 84 0 

CTGCTGGGTG GCCTCCCAAC ACAAGTTGAC 6 90 0 



CCTGTCCGAA GTGACC3GAC AGGGACTCTG CATAGGAGCA GTTCGCAAAA CACATGAGGC 6'? 6 0 

cctat:;taat accacccaga caagcagtcg agggtcgtat tatctagttg cccctacagg 7o::o 

TACGATGTGG GGTTGTAGTA CCGGGCTTAC TCCATGCATC TCCACCACCA TACTGAACCT 7i"ih0 

TACCACTGAT TATTGTGTTC TTGTCGAACT CTGGCCAAGA GTCACCTATC ATTCCGGCAG 714 0 

CTATGTTTAC GGCCTGTTTG AGAGATCCAA CCGACACAAA AGAGAAGCGG TGTCGTTAAG 72 00 

CCTGGCCCTA TTATTGGGTG GACTAACCAT GGGGGGAATT GGGGGTGGAA TAGGAACAGG 72 6 0 

GACTACTGCT CTAATGGCCA CTCAGCAATT GGAGGAGCTG CAAGCCGCAG TACAGGATGA 73 2 0 

TGTGAGGGAG GTTGAAAAAT CAATCTCTAA CCTAGAAAA3 TCTCTCACTT CCCTGTCTGA 73*0 

AGTTGTCCTA CAGAATCGAA GGGGCCTAGA CTTGTTATTT CTAAAAGAAG GAGGGGTGTG 744 0 

TGCTGCTCTA AAAGAAGAAT GTTGCTTCTA TGCGGACCAC ACAGGACTAG TGAGAGACAG 7b 0 0 

CATGGCCAAA TTGAGAGAGA GGCTTAATCA GAG AC AG AAA CTGTTTGAGT CAACTCAAGG 7 5*0 

ATGGTTTGAG GGACTGTTTA ACAGATCCCC TTGGTTTACC ACCTTGATAT CTACCATTAT 76 2 0 

GGGACCCCTC ATTGTACTCC TAATGATTTT GCTCTTCGGA CCCTGCATTC TTAATCGATT 76*0 

AGTCCAATTT GTTAAAGACA GGATATCAGT GGTCCAGGCT CTAGTTTTGA CTCAACAATA 774 0 

TCACCAGCTG AAGCCTATAG AGTACGAGCC ATAGATAAAA TAAAAGATTT TATTTAGTCT 78 0 0 

C C AG AAAAAG GGGGGAATGA AAGACCCCAC CTGTAGGTTT GGCAAGCTAG CTTAAGTAAC 78 6 0 

GCCATTTTGC AAGGCATGGA AAAATACATA ACTGAGAATA GAGAAGTTCA GATCAAGGTC 7 92 0 

AGGAACAGAT GGAACAGCTG AATATGGGCC AAACAGGATA TCTGTGGTAA GCAGTTCCTG 798 0 

CCCCGGCTCA GGGCCAAGAA CAGATGGAAC AGCTGAATAT GGGCCAAACA GGATATCTGT 804 0 

GGTAAGCAGT TCCTGCCCCG GCTCAGGGCC AAGAACAGAT GGTCCCCAGA TGCGGTCCAG 8100 

CCCTCAGCAG TTTCTAGAGA AC CATC AG AT GTTTCCAGGG TGCCCCAAGG ACCTGAAATG 816 0 

ACCCTGTGCC TTATTTGAAC TAACCAATCA GTTCGCTTCT CGCTTCTGTT CGCGCGCTTC 82 2 0 

TGCTCCCCGA GCTCAATAAA AGAGCCCACA ACCCCTCACT CGGGGCGCCA GTCCTCCGAT 82 8 0 

TGACTGAGTC GCCCGGGTAC CCGTGTATCC AATAAACCCT CTTGCAGTTG CA 8 3 32 

<2) INFORMATION FOR SEQ ID NO: 16: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 



(D) TOPOLIGY : linear 
ni) MOLECULE TYPE: DNA [ genomic) 
(xi) SEQUENCE DES 2F.I FTI jN : SEQ ID NO:16 
GGGAGTGGTA ACAGTCTGGC ■ 'TTAATTOTO AG 

(2) IN FORMAT ION FOR 3E V > ID NO: 17: 

; i ) sequence ::hara:ter:stics : 

(A) LENGTH: .:3 base pairs 
(B> TYPE: nu-vLeic acid 
(0) STRANDEDNESS : single 

f D ) TOPOLOGY : 1 in-:- hr 
(ii) MOLECULE TYPE: DNA (genomic) 
{xi) SEQUENCE DES JF. I F TION : SEC ID NO: 17 
CGGTCGACCT CGAGAATTAA TTC 

1 2 ) INFORMATION FOR SE'J ID NO: 13: 

(i i sequence :haracteristics : 

i A) LENGTH: 2 3 base pairs 

( B ) TYPE: nucLeic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(iii MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 8 
CTGGGAGACG TCCCAGGGAC TTC 

(2) INFORMATION FOR SEO ID MO : 1 9 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: <2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 9 
GGCCAGACTG TTACCACTCC CTGAAGTTTG AC 

(2) INFORMATION FOR SEQ ID NO : 2 0 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 0 
CATCGATAAA ATAAAAGATT TTATTTAGTC 

(2) INFORMATION FOR SEQ ID NO : 2 1 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 1 
CAAATGAAAG ACCCCCGCTG AC 




(2) INFORMATION FOR SEQ ID NO:.':2: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: O base pairs 
■ ?.) TYPE: nu Oeic ac:J 
<C> STRANDEDNESS: single 
■CO TOPOLJiY: linear 
(li) MOLECULE TYPE: DNA (gencmic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID N2:22: 
GAAGCTTCTC CCAGAACOCA • .'OAGTCTTGA AACGCCATC 39 

(I!) INFORMATION FOR SEJ ID NO : 2 3 : 
(i) SEQUENCE CHARACTERISTICS: 
i A) LENGTH: 25 base r:>airs 



(2) INF 2RMATION FOR SEQ ID NO:J4: 
(i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: .V 6 base pairs 

(B) TYPE: nucleic acid 
iC) STRANDEDNSSS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CTCCTCGAGC TAAAGATATT TTAGAGAAGA ATTAAC 3 6 

(2) INFORMATION FOR SEQ ID NO:25: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : DNA (genomic) 
(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 15: 
TTCCTCTGGA CAGCTGTCTA CTTTG 2 5 

(2) INFORMATION FOR SEQ) ID NO:26: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 51 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
TCGAGGGGCC CAGATCTGCG GCCGCTCGCG AGTCGACAAG CTTGGATCCA T 51 



( E ) TYPE: nu-leic ac i i 
i C) STRAN 2> E 2>N E S S : s ingle 
fD) TOPOLOGY: linear 



(ii) MOLE 2ULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID 



NO: 2 3 : 



GCACCAGCTT TTGGTCTCAT OAAAG 



25 



(2) INFORMATICS FOR SEn ID NO: 27: 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 49 base pairs 

(B) TYPE: nu: Leic red 

(C) STRANDEDNESS: .single 
i D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: CEO ID NO: 27: 
CGATGGATCC AAGCTTGTC3 ACTCG IJAG I 1 GGCCGCAGAT CTGGGCCCC 4 3 



(2) INFORMATION FOR SEQ ID NO: 2-3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nuvleic acid 

(C) STRANDED:; ESS : single 

(D) TOPI'LOjY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CACCGTCGTC GACTTATGCT 2 0 



(2) INFORMATION FOR SEQ ID HO i I'd: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: <4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 9 : 
GACCGTCGAC TCAATTCTGG GAGAAGCTTO TTGG 3 4 



(2) INFORMATION FOR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 0 : 
CACCGTCGTC GACTTATGCT 2 0 



(2) INFORMATION FOR SEQ ID NO : 3 1 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: s 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 1 : 
CAACGCTCGA GAAGCAGAAT CGCAAAAGGC 3 0 



(2) INFORMATION FOR SEQ ID NO: 32: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

ixii SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

t:sgctccaj • o\c ;aa 1 • ..; :;aaataa:t2 _;t 



32 



(2) INFORMATION FOR SEQ ID NO: 33: 
(i) SEQUENCE CHARACTERISTICS: 
■A) LENGTH: 3 1 base pair's 
| BO TYPE: nu-Oeic hcid 
t C) STRANDSDNESS : single 
iD) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA f genomic) 

(xi) sequence description: seo id nd:33: 
ccga:tcgag tcagtagagg tcctgtgcct c 31 

(2) information for seq id no: 34: 
(i) sequence chara 3ter i st ics : 

(A) LENGTH: 2 6 base pairs 

(&) TYPE: nu:Lei: acid 

(C) STRANDSDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEO ID NO: 34: 
GCGAOTCGAG CATGGGGCCC CGGGGC 2 6 

(2) INFORMATION FOR SEQ ID NO : 3 5 : 
{iJ SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 1 base pairs 

(E) TYPE: nucleic acid 

(C) STRANDSDNESS: single 

(D) TOPOLOGY: linear 

(ii) MC'LECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
GCACTGGAAT TCGTCAGGGC G 21 

(2) INFORMATION FOR SEQ ID NO: 36: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDSDNESS: single 

(D) TOPOLOGY : linear 

(ii) MC'LECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
CGCGCCGCTC GAGTCTACAA TGGCCTTGAC CTTTGCTTTA CTGG 4 4 

(2) INFORMATION FOR SEQ ID NO: 37: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 43 base pairs 
{ B) TYPE: nucleic acid 

(C) STRANDSDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
GCGCCCATCG ATTCATTOCT TACTTCTTAA ACTTTCTTGC AAG 4 3 



CM INFORMATION FOR SEQ ID NO : 3 8 : 
(l) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 7 base pairs 
(?) TYPE: nu;Tei i' acid 
( C STRANDEDNESS : single 
(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 8 : 
CGCGCCGCTC GAGCATCCAA TGGCCCTGTC CTTTTCTTTA CTTATGG 



(2> INFORMATION FOR SEC ID NO : 3 3 : 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 9 base pairs 
(EM TYPE: nucleic acid 
CM STRAND EDN ESS : single 
(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 9 : 
CGATCGATTC AATCOTTCCT CCTTAATCTT TTTTGCAAG 

CM INFORMATION FOR SEQ ID NO : 4 0 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION; SEQ ID NO : 4 0 : 
CCGGATCCTC TACAATGGCC TTGACCTTTG CTTTACTGG 

CM INFORMATION FOR SEQ ID NO : 4 1 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 1 : 
CGCGCCGGCG GCCGCTCATT CCTTACTTCT TAAACTTTCT TGCAAG 

(2) INFORMATION FOR SEQ ID NO: 42: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 2 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
CCGGATCCCA TCCAATGGCC CTGTCCTTTT CTTTACTTAT GG 

(2) INFORMATION FOR SEQ ID NO : 4 3 : 
(i) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STKANDEDNESS : single 

(D) TOPOLOCY: linear 

(ii) MOLECULE TYPS: DNA (.genomic) 
(xi) SEQUENCE DESCRIPTION: 3EQ ID NO : 4 3 : 

CGCGCCGGCG GCCGCTCAAT CCTTCCTCCT TAATCTTTTT TGCAAG 4 6 

(2) INFORMATION FOR SEQ ID NO: 44: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9080 base pairs 
(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 4 : 

GGGGGGGGGG GGGGGGGGGG GGGTGAGCAC ATCCAGTGGG TAAAGTTCCT TAAAATGCTC 6 0 

TGCAAAGAAA TTGGGACTTT TCATTAAATC AGAAATTTTA CTTTTTTCCC CTCCTGGGAG ICO 

CTAAAGATAT TTTAGAGAAG AATTAACCTT TTGCTTCTCC AGTTGAACAT TTGTAGCAAT 1m 0 

AAGTCATGCA AATAGAGCTC TCCACCTGCT TCTTTCTGTG CCTTTTGCGA TTCTGCTTTA 24 0 

GTGCCACCAG AAGATACTAC CTGGGTGCAG TGGAACTGTC ATGGGACTAT ATGCAAAGTG 3 00 

ATCTCGGTGA GCTGCCTGTG GACGCAAGAT TTCCTCCTAG AGTGCCAAAA TCTTTTCCAT 36 0 

TCAAOAOCTG AGTCGTGTAC AAAAAGACTC TGTTTGTAGA ATTCACGGAT CACCTTTTCA 4 JO 

ACATCGCTAA GCCAAGGCCA CCCTGGATGG GTCTGCTAGG TCCTACCATC CAGGCTGAGG 480 

TTTATGATAC AGTGGTCATT ACACTTAAGA ACATGGCTTC CCATCCTGTC AGTCTTCATG 54 0 

CTGTTGGTGT ATCCTACTGG AAAGCTTCTG AGGGAGCTGA ATATGATGAT CAGACCAGTC 600 

AAAGGGAGAA AGAAGATGAT AAAGTCTTCC CTGGTGGAAG CCATACATAT GTCTGGCAGG 6*0 

TCCTGAAAGA GAATGGTCCA ATGGCCTCTG ACCCACTGTG CCTTACCTAC TCATATCTTT 72 0 

CTCATGTGGA CCTGGTAAAA GACTTGAATT CAGGCCTCAT TGGAGCCCTA CTAGTATGTA 78 0 

GAGAAGGGAG TCTGGCCAAG GAAAAGACAC AGACCTTGCA CAAATTTATA CTACTTTTTG 84 0 

CTGTATTTGA TGAAGGGAAA AGTTGGCACT CAGAAACAAA GAACTCCTTG ATGCAGGATA 900 

GGGATGCTGC ATCTGCTCGG GCCTGGCCTA AAATGCACAC AGTCAATGGT TATGTAAACA 96 0 

GGTCTCTGCC AGGTCTGATT GGATGCCACA GGAAATCAGT CTATTGGCAT GTGATTGGAA 102 0 

TGGGCACCAC TCCTGAAGTG CACTCAATAT TCCTCGAAGG TCACACATTT CTTGTGAGGA 108 0 

ACCATCGCCA GGCGTCCTTG GAAATCTCGC CAATAACTTT CCTTACTGCT CAAACACTCT 114 0 

TGATGGACCT TGGACAGTTT CTACTGTTTT GTCATATCTC TTCCCACCAA CATGATGGCA 12 0 0 

TGGAAGCTTA TGTCAAAGTA GACAGCTGTC CAGAGGAACC CCAACTACGA ATGAAAAATA 12 6 0 

ATGAAGAAGC GGAAGACTAT GATGATGATC TTACTGATTC TGAAATGGAT GTGGTCAGGT 132 0 

TTGATGATGA CAACTCTCCT TCCTTTATCC AAATTCGCTC AGTTGCCAAG AAGCATCCTA 138 0 

AAACTTGGGT ACATTACATT GCTGCTGAAG AGGAGGACTG GGACTATGCT CCCTTAGTCC 14 4 0 

TCGCCCCCGA TGACAGAAGT TATAAAAGTC AATATTTGAA CAATGGCCCT CAGCGGATTG 15 0 0 

GTAGGAAGTA CAAAAAAGTC CGATTTATGG CATACACAGA TGAAACCTTT AAGACTCGTG 156 0 

AAGCTATTCA GCATGAATCA GGAATCTTGG GACCTTTACT TTATGGGGAA GTTGGAGACA 162 0 

CACTGTTGAT TATATTTAAG AATCAAGCAA GC AG AC CAT A TAACATCTAC CCTCACGGAA 168 0 

TCACTGATGT CCGTCCTTTG TATTCAAGGA G ATT AC C AAA AGGTGTAAAA CATTTGAAGG 174 0 

ATTTTCCAAT TCTGCCAGGA GAAATATTCA AATATAAATG GACAGTGACT GTAGAAGATG 18 0 0 

GGCCAACTAA ATCAGATCCT CGGTGCCTGA CCCGCTATTA CTCTAGTTTC GTTAATATGG 186 0 

AGAGAGATCT AGCTTCAGGA CTCATTGGCC CTCTCCTCAT CTGCTACAAA GAATCTGTAG 192 0 

ATCAAAGAGG AAACCAGATA ATGTCAGACA AGAGGAATGT CATCCTGTTT TCTGTATTTG 198 0 

ATGAGAACCG AAGCTGGTAC CTCACAGAGA ATATACAACG CTTTCTCCCC AATCCAGCTG 2 04 0 

GAGTGCAGCT TGAGGATCCA GAGTTCCAAG CCTCCAACAT CATGCACAGC ATCAATGGCT 210 0 

ATGTTTTTGA TAGTTTGCAG TTGTCAGTTT GTTTGCATGA GGTGGCATAC TGGTACATTC 216 0 

TAAGCATTGG AGCACAGACT GACTTCCTTT CTGTCTTCTT CTCTGGATAT ACCTTCAAAC 2 22 0 

ACAAAATGGT CTATGAAGAC ACACTCACCC TATTCCCATT CTCAGGAGAA ACTGTCTTCA 22 8 0 




tgt3gatgga aaacgcaggt ctatggattc 
gag jCatgac CjC:tta.:tg aaggtttgta 
ag3aga3tta tgaa jAtatt t:aggatagt 
3aaggttgt : :ca 3aattga a 3aga 3ggta 
haattc :aga aaatgagata gagaagagtg 
gtaaaataoa aaatg pgtgg t 3tag pgatt 
gagatggggt atggttatgt gatctccaag 
gatgaggtgg ag 3aatagag agtaataaca 
agctccatca gagtggggag atggtattta 
atgagaaagt gg3gagaa3t gga3gaa3a3 
gtagatgaaa taatctgatt t':aacaattc 
atagaagttg gttaggag 2-2 ggaagtatgg 
gtgtatttgg gaaaaagtga tgtggggtta 
aa3aaaataa pgattgaaag tt3tta3aat 
ggggaaaaaa tgtatggtga agagagagtg 
gaggt 3gttt gttgactaaa gataatggg p 
gaaagaaaag ttggaataat tcagcaacta 
tattaattga gaatagtcga tgagtgtgg 2 
aaaaagtgag aggtttgatt catgacagaa 
gggtaaatga tatgtcaaat aaaagtagtt 
aaaaagaggg ccccattcca ggagatggag 
tattottgcc agaatgagga aggtggatac 
gtggggaagg ccccagtcca aaggaattag 
gtgagaattt gttgtgtgag aaaaagaaa 3 
acgtaggact caaagagatg gtttttccaa 
ataatttaga tgaaaataat agagagaatg 
agaaggaaag attaatccaa gagaatgtag 
gtaagaattt gatgaagaag gttttgttag 
atgacggggg atatgctcca gtacttcaag 
gaagaaagaa acacacagct gatttgtgaa 
tgggaaatiia aaccaagcaa attgtagaga 
atacaagcca gcagaatttt gtcacgcaac 
tgggagtaga agaaacagaa cttgaaaaaa 
ggtggaaaaa catgaaacat ttgaggggga 
aggagaaagg ggcgattagt cagtctccct 
tgggtgaagg aaatagatct ggattaggga 
gag c tat at a tgtgaggagg gtcctattcc 
gttatagaaa gaaagattgt ggggtggaag 
aaaataaggt ttctttagcc attgtaaggt 
gctccctggg gagaagtggg agaaattgag 
tcgggaaagg agacttgccc aaaagatgtg 
tttatgagaa ggaggtattg ggtagggaaa 

TGGTGGAAGG GAGGGTTGTT GAGGGAAGAG 
GAGGTGGAAA AGTTCCCTTT CTGAGAGTAG 
AGGTATTGGA TGGTGTTGGT TGGGATAACC 
GGAAATCCCA AGAGAAGTCA GG AG AAAAAA 
GGGTGAAGGG TTGTGAAAGG AATGATGGAA 
GGGAAATAGA AGTCACCTGG GGAAAGGAAG 
GAGGAGTGTT GAAAGGGGAT GAAGGGGAAA 
AGGAAATTGA CTATGATGAT ACCATATCAG 
ATGATGAGGA TGAAAATGAG AGCCCCCGCA 
TTGCTGCAGT GGAGAGGGTG TGGGATTATG 
AGAGGGGTGA GAGTGGGAGT GTCCCTCAGT 
ATGGCTGGTT TACTGAGGGG TTATAGGGTG 




tgg3gtg:::a :aagtga3ag tttcggaaca 2-40 

GTT 2 VGA 3AA iAAGA- T 3GT GATTA PTA 2 2 \ \ 4 0 0 

t jgt jA 3taa aaa:aatg:g att^aaccaa d 
ggagta j'jCa aaaggaattt aatg::a::a 
agggttggtt tgcagacaga agaggtatgg 

t jTtgatgct gtt jGGagag agtggtagtg :: ». 4 o 

aagccaaata tgagagtttt tgtgat3atg 2 70 0 

gg:tgtgtga aapga:agag ttgagggga: 2 7ko 

ggggtgagtg aggggtggaa ttaagattaa 2^g:;i 

agtt5aagaa agttgattt: AAAGTTTGTA 28 
gat:agagaa tttgggagga ggtagtgata i 

:agttgatta tgatagtgaa ttagata:ga a 
gtgagtgtgg tggacgtctg aggttgagtg 

3aggttpaat gaafagggaa gaaa3ttgat 3 12'.> 

GTAGGTTATT TAAAGGGAAA AGAGGTGATG :um:i 

TATTGAAA3T TAGGATGTGT TTGTTAAAGA 3G:i 

ATAGAAAGAG TGAGATTGAT GGCCCATCAT 3 300 

AAAATATATT AGAAAGTGAG AGT3A3TTTA ;s3*fu 

TGCTTATGGA CAAAAATGCT AGAGCTTTGA 342 u 

G AT G AAAAAA CATGGAAATG GTG 3AAGAGA ;wsi' 

AAAATGGAGA TATGTCGTTC TTTAAGATGC 3 54 0 

AAAGGAGTCA TGGAAAGAAG TGTGTGAA3T 3 600 

TATGGTTAGG AGGAGAAAAA TGTGTGGAAG 3 66 0 

TGGTAGTAGG AAAGGGT 3AA TTTACAAAGG 3 73 0 

GGAGGAGAAA CCTATTTCTT ACTAACTTGG 3 78 0 

AAGAAAAAAA AATTGAGGAA GAAATAGAAA 3 84 i.) 

TTTTGCCTCA GATAGATAGA GTGACTGGCA 3 '3 0 0 

TGAGGAGTAG GGAAAATGTA GAAGGTTCAT 3^6 0 

ATTTTAGGTC ATTAAATGAT TGAAGAAATA 4 03 0 

AAAAAGGGGA GGAAGAAAAG TTGGAAGG3T 4 08 0 

AATATGGATG GAGGAGAAGG ATATGTGGTA 4 14 0 

GTAGTAAGAG AGGTTTGAAA GAATTGAGAG 42 00 

GGATAATTGT GGATGACAGG TGAAGGGAGT 42 6 0 

GGAGGGTGAG AGAGATAGAG TAGAATGAGA 4 32 0 

TATGAGATTG CCTTACGAGG AGTCATAGCA 4 38 0 

TTGGAAAGGT ATCATCATTT GGATGTATTA 444 0 

AAGAGAAGTG TTGTGATGTT GGAGGAGGAT 4 5 00 

AAAGGAGTGA TTTGTTAGAA GGAGGGAAAA 4 56 0 

TGGAGATGAG TGGTGATGAA AGAGAGGTTG 4 62 0 

TGAGATACAA GAAAGTTGAG AAGAGTGTTG 4 6 8 0 

GCAAAGTTGA ATTGGTTGCA AAAGTTGAGA 4 74 0 

CTAGCAATGG GTGTGGTGGG GATGTGGATG 4 8 00 

AGGGAGGGAT TAAGTGGAAT GAAGGAAAGA 4 860 

GAAGAGAAAG CTCTGCAAAG ACTCCCTCCA 4 920 

ACTATGGTAC TGAGATAGCA AAAGAAGAGT 4 9 80 

CAGCTTTTAA GAAAAAGGAT ACCATTTTGT 504 0 

TAGCAGCAAT AAATGAGGGA GAAAATAAGG 5100 

GTAGGACTGA AA(3GGTGTGG TGTGAAAAGG 516 0 

TAACTGGTAC TACTCTTCAG TGAGATGAAG 52 2 0 

TTGAAATGAA GAAGGAAGAT TTTGACATTT 521 8 C' 

GGTTTGAAAA GAAAAGAGGA GAGTATTTTA 53 4 0 

GGATGAGTAG CTGGGGAGAT GTTGTAAGAA 54 00 

TCAAGAAAGT TGTTTTGGAG GAATTTAGTG 54 6 0 

GAGAAGTAAA TGAACATTTG GGACTCCTGG 5 52 0 




AT AT AT AAGA j 2A 3AA 3TTGAAGATA AT AT CAT G ^T AACTTTCAGA AAT:AGj2GT 55H0 

■jt'JjTCCCta ttcgttctat tctagc;:tta tttgttatia ggaagatcag ag^gaAjGag e.^-;o 

cagaac:taj aaaaaacttt 3tcaa:3j::ta atgaaa:2aa aa:tta:ttt t:maaa gtgc ecmo 

AA2ATCATAT GGCACCCAGT AAAGATGAGT TT3ACTG2AA A jCCTGGGCT TATTT:TCTJ E.^D 

atgttgag:t ggaaaaaoat gtgcactcag gcgfgattgg acccjttctg gtgtg::aga 5^0) 

ctaacacact gaacgctgct gatgg ^agag aagtgagagt acaggaattt gctgtgtttt 58h:) 

TZAOCATGTT TGATGAGACC AAAAGCTG jT AGTTCACTGA AAATATGGAA A 5AAAGTGGA E.-*4:l 

GG5CTCCCTG CAATATCGAG ATGGAAGATC CCATrTTTAA AGAGAATTAT CGCTTCCATG •;.!) ■:):.! 

CAATCAATj3 GTAGAT.AATG GATAGACTAC GTGGGTTA jT AATGGCTCAG GATCAAA GGA >G>, j 

TTGGAT jGTA TGTGGTGAGG ATGGGCAGCA ATGAAAAGAT GGATTGTATT CATTTCAGTG <:12i) 

ga:at3tgtt ca:tgtac^a aaaaaaiagg agtataaaat gggagtgtag aatztctatc ^i-.o 

CAGGTGTTTT TGAGAGAGTG GAAAFGTTAC GATGGAAAGG TGGAATTTGG GGGGTGGAAT ^24m 

3CCTTATTGG CGAGCATCTA CATGIT3GGA T3AGCACACT TTTTCTGGTG TAGAGGAATA 6 ; ; • 0 

AGTGTGAGAG TCCCCTGGGA ATGGGTTGTG GAGAGATTAG AGATTTTGAG ATTAGAGGTT 6 

CAGGACAATA TGGAGAGTGG GGGGGAAAGG TG3CCAGACT TCATTATTC 2 GGATGAAT 2 A 64 2 0 

ATGCGTGGAG CACCAAGGAG CCCTTTTITT GGATCAAGGT GGATGTGTTG GCACCAATGA 6 4 Mil 

TTATTGAGGG CATCAAGACC CAGGGTG :GG GTGAGAAGTT GTGGAGGGTG TAIIATCTCT: 6b4'.i 

AGTTTATCAT CATGTATAGT CTTGATGGGA AGAAGTGGGA GACTTAT2GA GGAAATTGCA 66; in 

CTGGAACCTT AATGGTGTTG TTTGGGAATG TGGATTCATC TGGGATAAAA CACAATATTT 666 i.i 

TTAACCCTCG AATTATTGGT CGATAGATGC GTTTGCACCC AAGTCATTAT AGGATTGGGA 6 72 0 

GGAGTGTTGG CATGGAGTTG ATGGGCTGTG ATTTAAATAG TTGCAGCATG CCATTGGGAA 6 78 0 

TGGAGAGTAA AGGAATATGA GATGG AG AG A TTACTGCTTC ATGGTAGTTT ACGAATATGT 6840 

TTGGGAGGTG GTCTCCTTCA AAAGGTGGAG TTCACCTGCA AGGGAGGAGT AATGCCTGGA 6 9 00 

GAGCTCAGGT GAATAATCCA AAAGAGTGGC TGG.AAGTGGA GTTGGAGAAG AG.AATGAAAG 6 96 0 

TGAGAGGAGT AAGTAGTGAG GGAGTAAAAT GTGTGGTTAG CAGCATGTAT GTGAAGGAGT 7 02 m 

TGGTGATGTG CAGGAGTGAA GATGG GG AT G AGTGGACTCT CTTTTTTCAG AATGGGAAAG 7 0*f> 

TAAAGGTTTT TCAGGGAAAT CAAGACTCCT TGAGAGCTGT GGTGAAGTCT GTAGAGGGAG 7 14 0 

GGTTAGTGAG TGGGTAGGTT GGAATTGACG CCCAGAGTTG GGTGGACGAG ATTGCCCTGA 72 00 

GGATGGAGGT TGTGGGGTGG GAGGGAGAGG ACCTCTACTG AGGGTGGCCA GTGGAGGAGG 72 6 0 

TGCCACTGCC GTGAGGTGTG CCTCCTCAGC TCCAGGGCAG TGTCCCTCCC TGGCTTGCCT 73 2 0 

TCTACCTTTG TGGTAAATGG TAGCAGACAC TGOCTTGAAG CCTCCTGAAT TAAGTATGAT 7 3 3 0 

CAGTGGTGGA TTTCTTTGGT GGGGGGGCAG GAGGGTGGAT GCAATTTAAG TTAACTCTTA 74 4 0 

CCTATTTTCT GCAGCTGCTC GGAGATTAGT CCTTCCTTCC AATATAAGTA GGGAAAAAGA 75 0 0 

AGTGAGGAGA AAGGTGGATG AAAGGATTGT TGGGTGAAAA GTTAGGCCTC TCAGAGTCAC 756 0 

CACTTGCTCT GTTGTAGAAA AACTATGTGA TGAAACTTTG AAAAAGATAT TTATGATGTT 7 62 0 

AACATTTCAG GTTAAGCCTC ATACGTTTAA AATAAAACTC TGAGTTGTTT ATTATCCTGA 7 68 0 

TGAAG GATGG AAGAAAGGAT GTTTCAGGAT CAGATGAATA CAATCTTGGA GTCAAAAGGC 7 74 0 

AAATGATTTG GAGAATGTGG AAAATGGAGA GAATACAATA ACTACTACAG TAAAGTCTGT 78 0 0 

TTGTGGTTGG TTACACATAG ATATAATTAT GTTATTTAGT CATTATGAGG GGCACATTCT 7 86 0 

TATCTGCAAA AGTAGGATTG TTAAACTGAG AATTATAGAT GGGGTTCAAG AATGGGTAAG 7 92 0 

TCCCCTGAAA TTATATAAGG CATTCTGTAT AAATGCAAAT GTGGATTTTT CTGACGAGTG 7 98 0 

TG GAT AG AT A TAAAGCCATT TGGTCTTAAT TCTGACCAAT AAAAAAATAA GTCAGGAGGA 8 04 0 

TGCAATTGTT GAAAGGTTTG AAATAAAATA ACAATGTCTT GTTGAAATTT GTGATGGCGA 810 0 

AGAAAGAAAA TGATGATGAC ATTAGGGTTC TAAAGGAGAT ACATTTAATA TTTCTGTGGA 816 0 

AATATGAGGA AAATCCATGG TTATCTGAGA TAGGAGATAG AAACTTTGTA ATTGTAATAA 8 22 0 

TGCACTCAGT TTAGTGTGTG CCTCTACTAA TTTGGTGCTG AAAATAACAC AAGAAAAATG 82*0 

TAACAGGGGA AATTATATAC CGTGACTGAA AACTAGAGTC GTAGTTAGAT AGTTGAAATA 8 34 0 

TCAAGGAGGT CAGAAGAAAA TTGGACTGGT GAAAAGAGAA AAAAGACTGG AGTGTGGGAT 84 0 0 

ATGACCACAG AATAGGATGG CCCTTCTTGC GCTGGACCCC GATAAGATTG TGAAGGGTTT 84 6 0 

ACTGCTCCTT CCATCTGCCT GAGGGGTTGA CTATGACTAC AGAGAATCTO CTGATAGTAA 8 52 0 

AGGGGGCTGG AGGCAAGGAT AAGTTATAGA GCAGTTGGAG GAAGGATGGA AAGATTGGAA 8 58 0 

GCCAGGGCAA ATGGAAAACA GGAGATCCTA ATATGAAAGA AAAATGGATC CCAATCTGAG 8 64 0 

AAAAGGCAAA AGAATGGCTA CTTTTTTGTA TGCTGGAGTA TTTTGTAATA ATGCTGCTTG 870 0 

AGGCTTATGT GAGGTGTTTG GAAAGTATAA CATAGCTGTC ACAGTATAGT CAGAATCGAG 876 0 



• 



AAATGATGCA GGTGCAAATG GTTTATAGCC GTGTGAAGTT CTTAAAGTTT AGAGGCTAAC 882 0 

TTACA^AAAT JAATAAGTTG TTTTGTTTTA lAGCCCGGTA GAGGAGTTAA CCCCAAAGGT 88 8 0 

GATATGGTTT TATTTCCTGT TATGTTTAAC TTAATAATCT TATTTTGGCA TTCTTTTCCC 8 3 4 0 

ATTGAGTATA TACATCTCTA TTTOTOAAAT jTTCATGGAA CTAGCTCTTT TATTTTCCTG 90 00 

CTGGTTTCTT JAGTAATGAG TTAAATAAAA JATTGACACA TACAAAAAAA AAAAAAAAAA 9 06 0 

AAAAAAAAAA AAAAAAAAAA 9 08 0 

(2) INFORMATION FOR SEQ ID NO : 4 5 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2351 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 5 : 

Met Gin lie Glu Leu Ser Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe 
1 5 10 15 

Cys Phe Ser Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser 
2 0 2 5 3 0 

Trp Asp Tyr Met Gin Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg 
3 5 4 0 4 5 

Phe Pro Pro Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Val 
50 55 60 

Tyr Lys Lys Thr Leu Phe Val Glu Phe Thr Asp His Leu Phe Asn lie 
6 5 70 75 80 

Ala Lys Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr lie Gin 

85 90 95 

Ala Glu Val Tyr Asp Thr Val Val He Thr Leu Lys Asn Met Ala Ser 
100 105 110 

His Pro Val Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser 
115 120 125 

Glu Gly Ala Glu Tyr Asp Asp Gin Thr Ser Gin Arg Glu Lys Glu Asp 
130 135 140 

Asp Lys Val Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gin Val Leu 
145 150 155 160 

Lys Glu Asn Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser 

165 170 175 

Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu He 
180 185 190 

Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr 
195 200 205 



Gin Thr Leu His Lys Phe lie Leu Leu Phe Ala Val Phe Asp Glu Gly 
210 215 220 

Lys Ser Trp His Ser Glu Thr Lys Asn Ser Leu Met Gin Asp Arg Asp 
225 230 235 240 

Ala Ala Ser Ala Arg Ala Trp Pro Lys Met His Thr Val Asn Gly Tyr 

245 250 255 

Val Asn Arg Ser Leu Pro Gly Leu lie Gly Cys His Arg Lys Ser Val 
260 265 270 

Tyr Trp His Val He Gly Met Gly Thr Thr Pro Glu Val His Ser He 
275 280 285 

Phe Leu Glu GLy His Thr Phe Leu Val Arg Asn His Arg Gin Ala Ser 

290 295 300 

Leu 31u lie Ser Pro He Thr Phe Leu Thr Ala Gin Thr Leu Leu Met 
3 05 310 315 32 0 

Asp Leu Gly Gin Phe Leu Leu Phe Cys His He Ser Ser His Gin His 

325 330 335 

Asp Gly Met Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro 
3 4 0 3 4 5 3 5 0 

Gin Leu Arg Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp 
355 360 365 

Leu Thr Asp Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser 
370 375 380 

Pro Ser Phe lie Gin He Arg Ser Val Ala Lys Lys His Pro Lys Thr 
385 390 395 400 

Trp Val His Tyr He Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro 

405 410 415 

Leu Val Leu Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gin Tyr Leu Asn 
420 425 430 

Asn Gly Pro Gin Arg He Gly Arg Lys Tyr Lys Lys Val Arg Phe Met 
435 440 445 

Ala Tyr Thr Asp Glu Thr Phe Lys Thr Arg Glu Ala He Gin His Glu 
450 455 460 

Ser Gly He Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu 
465 470 475 480 



Leu He He Phe Lys Asn Gin Ala Ser Arg Pro Tyr Asn He Tyr Pro 

485 490 495 



His 31 y lie Thr Asp Val Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys 
5D0 505 510 

Gly Val Lys His Leu Lys Asp Phe Pro lie Leu Pro Gly Glu He Phe 
515 520 525 

Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp 
5 30 53 5 54 0 

Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg 
545 550 555 560 

Asp Leu Ala Ser Gly Leu He Gly Pro Leu Leu He Cys Tyr Lys Glu 

565 570 575 

Ser- Val Asp Gin Arg Gly Asn Gin He Met Ser Asp Lys Arg Asn Val 
580 585 590 

He Leu Phe Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu 

595 6 00 6 05 

Asn He Gin Arg Phe Leu Pro Asn Pro Ala Gly Val Gin Leu Glu Asp 
610 615 6 2 0 

Pro Glu Phe Gin Ala Ser Asn He Met His Ser He Asn Gly Tyr Val 
625 630 635 640 

Phe Asp Ser Leu Gin Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp 

645 650 655 

Tyr He Leu Ser He Gly Ala Gin Thr Asp Phe Leu Ser Val Phe Phe 
660 665 670 

Ser Gly Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr 
675 680 685 

Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro 
690 695 700 

Gly Leu Trp He Leu Gly Cys His Asn Ser Asp Phe Arg Asn Arg Gly 
705 710 715 720 

Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp 

725 730 735 

Tyr Tyr Glu Asp Ser Tyr Glu Asp He Ser Ala Tyr Leu Leu Ser Lys 
740 745 750 

Asn Asn Ala He Glu Pro Arg Ser Phe Ser Gin Asn Ser Arg His Pro 
755 760 765 



Ser Thr Arg Gin Lys Gin Phe Asn Ala Thr Thr lie Pro Glu Asn Asp 
770 775 780 



He Giu Lys Thr Asp Pro Trp Phe Ala His Arg Thr Pro Met Pro Lys 
785 790 795 800 

lie Gin Asn Val Ser Ser Ser Asp Leu Leu Met. Leu Leu Arg Gin Ser 

3 0 5 8 10 815 

Pro Thr Pro His Gly Leu Ser Leu Ser Asp Leu Gin Glu Ala Lys Tyr 
820 825 830 

Glu Thr Phe Ser Asp Asp Pro Ser Pro Gly Ala lie Asp Ser Asn Asn 
835 840 845 

Ser Leu Ser Glu Met Thr His Phe Arg Pro Gin Leu His His Ser Gly 

350 3 55 86 0 

Asp Met Val Phe Thr Pro Glu Ser Gly Leu Gin Leu Arg Leu Asn Glu 
865 870 875 880 

Lys Leu Gly Thr Thr Ala Ala Thr Glu Leu Lys Lys Leu Asp Phe Lys 

885 830 895 

Val Ser Ser Thr Ser Asn Asn Leu lie Ser Thr lie Pro Ser Asp Asn 
9 0 0 9 0 5 910 

Leu Ala Ala Gly Thr Asp Asn Thr Ser Ser Leu Gly Pro Pro Ser Met 
915 920 925 

Pro Val His Tyr Asp Ser Gin Leu Asp Thr Thr Leu Phe Gly Lys Lys 
930 535 940 

Ser Ser Pro Leu Thr Glu Ser Gly Gly Pro Leu Ser Leu Ser Giu Glu 
94 5 950 955 9 60 

Asn Asn Asp Ser Lys Leu Leu Glu Ser Gly Leu Met Asn Ser Gin Glu 

965 970 975 

Ser Ser Trp Gly Lys Asn Val Ser Ser Thr Glu Ser Gly Arg Leu Phe 

980 985 990 

Lys Gly Lys Arg Ala His Gly Pro Ala Leu Leu Thr Lys Asp Asn Ala 
995 1000 1005 

Leu Phe Lys Val Ser lie Ser Leu Leu Lys Thr Asn Lys Thr Ser Asn 
1010 1015 1020 

Asn Ser Ala Thr Asn Arg Lys Thr His lie Asp Gly Pro Ser Leu Leu 
1025 1030 1035 1040 

lie Glu Asn Ser Pro Ser Val Trp Gin Asn lie Leu Glu Ser Asp Thr 

104 5 105 0 105 5 



Glu Phe Lys Lys Val Thr Pro Leu lie His Asp Arg Met Leu Met Asp 
1060 1065 1070 



Lys Asn Ala Thr Ala Leu Arg Leu Asn His Met Ser Asn Lys Thr Thr 
1075 1080 1085 

Ser Ser Lys Asn Met Glu Met Val Gin Gin Lys Lys Glu Gly Pro lie 
1090 1095 1100 

Pro Pro Asp Ala Gin Asn Pro Asp Met Ser Phe Phe Lys Met Leu Phe 
110 5 1110 1115 1120 

Leu Pro Glu Ser Ala Arg Trp lie Gin Arg Thr His Gly Lys Asn Ser 

1125 1130 1135 

Leu Asn Ser Gly Gin Gly Pro Ser Pro Lys Gin Leu Val Ser Leu Gly 
1140 1145 1150 

Pre> Glu Lys Ser Val Glu Gly Gin Asn Phe Leu Ser Glu Lys Asn Lys 
1155 1160 1165 

Val Val Val Gly Lys Gly Glu Phe Thr Lys Asp Val Gly Leu Lys Glu 
1170 1175 1130 

Met Val Phe Pro Ser Ser Arg Asn Leu Phe Leu Thr Asn Leu Asp Asn 
1135 1190 1195 1200 

Leu His Glu Asn Asn Thr His Asn Gin Glu Lys Lys lie Gin Glu Glu 

1205 1210 1215 

He Glu Lys Lys Glu Thr Leu He Gin Glu Asn Val Val Leu Pro Gin 
1220 1225 1230 

He His Thr Val Thr Gly Thr Lys Asn Phe Met Lys Asn Leu Phe Leu 
1235 1240 1245 

Leu Ser Thr Arg Gin Asn Val Glu Gly Ser Tyr Asp Gly Ala Tyr Ala 
1250 1255 1260 

Pro Val Leu Gin Asp Phe Arg Ser Leu Asn Asp Ser Thr Asn Arg Thr 
1265 1270 1275 1280 

Lys Lys His Thr Ala His Phe Ser Lys Lys Gly Glu Glu Glu Asn Leu 

1285 1290 1295 

Glu Gly Leu Gly Asn Gin Thr Lys Gin He Val Glu Lys Tyr Ala Cys 
1300 1305 1310 

Thr Thr Arg He Ser Pro Asn Thr Ser Gin Gin Asn Phe Val Thr Gin 
1315 1320 1325 

Arg Ser Lys Arg Ala Leu Lys Gin Phe Arg Leu Pro Leu Glu Glu Thr 
1330 1335 1340 



Glu Leu Glu Lys Arg He He Val Asp Asp Thr Ser Thr Gin Trp Ser 
1345 1350 1355 1360 



Lys Asn Met Lys His Leu Thr Pro Ser Thr Leu Thr Gin lie Asp Tyr 

1365 1370 1375 

Asn Glu Lys Glu Lys Gly Ala lie Thr Gin Ser Pro Leu Ser Asp Gys 
13 80 1385 13 90 

Leu Thr Arg Ser His Ser lie Pro Gin Ala Asn Arg Ser Pro Leu Pro 
13 95 14 00 14 05 

lie Ala Lys Val Ser Ser Phe Pro Ser He Arg Pro He Tyr Leu Thr 
1410 1415 1420 

Arg Val Leu Phe Gin Asp Asn Ser Ser His Leu Pro Ala Ala Ser Tyr 
1425 1430 1435 1440 

Arg Lys Lys Asp Ser Gly Val Gin Glu Ser Ser His Phe Leu Gin Gly 

1445 1450 1455 

Ala Lys Lys Asn Asn Leu Ser Leu Ala He Leu Thr Leu Glu Met Thr 
1460 1465 1470 

Gly Asp Gin Arg Glu Val Gly Ser Leu Gly Thr Ser Ala Thr Asn Ser 
1475 1480 1485 

Val Thr Tyr Lys Lys Val Glu Asn Thr Val Leu Pro Lys Pro Asp Leu 
1490 1495 1500 

Pro Lys Thr Ser Gly Lys Val Glu Leu Leu Pro Lys Val His He Tyr 
1505 1510 1515 1520 

Gin Lys Asp Leu Phe Pro Thr Glu Thr Ser Asn Gly Ser Pro Gly His 

152 5 1530 15 3 5 

Leu Asp Leu Val Glu Gly Ser Leu Leu Gin Gly Thr Glu Gly Ala He 
1540 1545 1550 

Lys Trp Asn Glu Ala Asn Arg Pro Gly Lys Val Pro Phe Leu Arg Val 
15 55 1560 1565 

Ala Thr Glu Ser Ser Ala Lys Thr Pro Ser Lys Leu Leu Asp Pro Leu 
1570 1575 1580 

Ala Trp Asp Asn His Tyr Gly Thr Gin He Pro Lys Glu Glu Trp Lys 
1585 1590 1595 1600 

Ser Gin Glu Lys Ser Pro Glu Lys Thr Ala Phe Lys Lys Lys Asp Thr 

1605 1610 1615 

lie Leu Ser Leu Asn Ala Cys Glu Ser Asn His Ala He Ala Ala lie 
1620 1625 1630 



Asn Glu Gly Gin Asn Lys Pro Glu lie Glu Val Thr Trp Ala Lys Gin 
16 35 16 40 1645 



Gly Arg Thr Glu Ar g Leu Cys Ser Gin Asn Pro Pro Val Leu Lys Arg 
1650 16 55 16 6 0 

His Gin Arg Glu lie Thr Arg Thr Thr Leu 31 n Ser Asp 31n Glu Glu 
16 6 5 1670 1675 16 30 

lie Asp Tyr Asp Asp Thr lie Ser Val Glu Met Lys Lys Glu Asp Phe 

1635 1690 1695 

Asp lie Tyr Asp Glu Asp Glu Asn Gin Ser Pro Ar<g Ser Phe Gin Lys 
1700 1705 1710 

Lys Thr Arg His Tyr Phe lie Ala Ala Val Glu Arg Leu Trp Asp Tyr 
1715 1720 1725 

Gly Met Ser Ser Ser Pro His Val Leu Arg Asn Arg Ala Gin Ser Gly 
1730 1735 1740 

Ser Val Pro Gin Phe Lys Lys Val Val Phe Gin Glu Phe Thr Asp Gly 
1745 1750 1755 17 v 0 

Ser Phe Thr Gin Pro Leu Tyr Arg Gly Glu Leu Asn Glu His Leu Gly 

1765 1770 1775 

Leu Leu Gly Pro Tyr lie Arg Ala Glu Val Glu Asp Asn lie Met Val 
1780 1785 1790 

Thr Phe Arg Asn Gin Ala Ser Arg Pro Tyr Ser Phe Tyr Ser Ser Leu 
1795 1800 1805 

lie Ser Tyr Glu Glu Asp Gin Arg Gin Gly Ala Glu Pro Arg Lys Asn 
1810 1815 1820 

Phe Val Lys Pro Asn Glu Thr Lys Thr Tyr Phe Trp Lys Val Gin His 
1825 1830 1835 1840 

His Met Ala Pro Thr Lys Asp Glu Phe Asp Cys Lys Ala Trp Ala Tyr 

1845 1850 1855 

Phe Ser Asp Val Asp Leu Glu Lys Asp Val His Ser Gly Leu lie Gly 
1860 1865 1870 

Pro Leu Leu Val Cys His Thr Asn Thr Leu Asn Pro Ala His Gly Arg 
1875 1880 1885 

Gin Val Thr Val Gin Glu Phe Ala Leu Phe Phe Thr He Phe Asp Glu 
1890 1895 1900 

Thr Lys Ser Trp Tyr Phe Thr Glu Asn Met Glu Arg Asn Cys Arg Ala 
19 0 5 1910 1915 192 0 



Pro Cys Asn He Gin Met Glu Asp Pro Thr Phe Lys Glu Asn Tyr Arg 

1925 1930 1935 



Phe His Ala He Asn Gly Tyr lie Met Asp Thr Leu Pro Gly Leu Val 
194 0 194 5 195 0 

Met Ala Gin Asp Gin Arg He Arg Trp Tyr Leu Leu Ser Met Gly Ser 
1955 1960 1965 

Asn Glu Asn He His Ser lie His Phe Ser Gly His Val Phe Thr Val 
1970 1375 1980 

Arg Lys Lys Glu Glu Tyr Lys Met Ala Leu Tyr Asn Leu Tyr Pro Gly 
1985 1990 1995 2000 

Val Phe Glu Thr Val Glu Met Leu Pro Ser Lys Ala Gly He Trp Arg 

2 0 05 2010 2 015 

Val Glu Gys Leu He Gly Glu His Leu IPis Ala Gly Met Ser Thr Leu 

2020 2025 2030 

Phe Leu Val Tyr Ser Asn Lys Gys Gin Thr Pro Leu Gly Met Ala Ser 
2035 2040 2045 

Gly His He Arg Asp Phe Gin He Thr Ala Ser Gly Gin Tyr Gly Gin 
2050 2055 2060 

Trp Ala Pro Lys Leu Ala Arg Leu His Tyr Ser Gly Ser He Asn Ala 
2065 2070 2075 2080 

Trp Ser Thr Lys Glu Pro Phe Ser Trp He Lys Val Asp Leu Leu Ala 

2085 2090 2095 

Pro Met He He His Gly He Lys Thr Gin Gly Ala Arg Gin Lys Phe 
2100 2105 2110 

Ser Ser Leu Tyr He Ser Gin Phe He He Met Tyr Ser Leu Asp Gly 
2115 2120 2125 

Lys Lys Trp Gin Thr Tyr Arg Gly Asn Ser Thr Gly Thr Leu Met Val 
2130 2135 2140 

Phe Phe Gly Asn Val Asp Ser Ser Gly He Lys His Asn He Phe Asn 
2145 2150 2155 2160 

Pro Pro lie He Ala Arg Tyr lie Arg Leu His Pro Thr His Tyr Ser 

2165 2170 2175 

He Arg Ser Thr Leu Arg Met ("Liu Leu Met Gly Gys Asp Leu Asn Ser 
2180 2185 2190 

Gys Ser Met Pro Leu Gly Met Glu Ser Lys Ala He Ser Asp Ala Gin 
2195 2200 2205 



He Thr Ala Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Trp Ser Pro 
2210 2215 2220 




Ser Lys Ala Arg Leu His Leu Gin Gly Arg Ser Asn Ala Trp Arg Pro 
2225 2230 2235 2240 

Gin Val Asn Asn Pro Lys Glu Trp Leu Gin Val Asp Phe Gin Lys Thr 

2245 2250 2255 

Met Lys Val Thr Gly Val Thr Thr Gin Gly Val Lys Ser Leu Leu Thr 
2260 2265 2270 

Ser Met Tyr Val Lys Glu Phe Leu lie Ser Ser Ser Gin Asp Gly His 
2275 228 0 2285 

Gin Trp Thr Leu Phe Phe Gin Asn Gly Lys Val Lys Val Phe Gin Gly 
2290 2295 2300 

Asn Gin Asp Ser Phe Thr Pro Val Val Asn Ser Leu Asp Pro Pro Leu 

2305 2310 2315 2320 

Leu Thr Arg Tyr Leu Arg lie His Pro Gin Ser Trp Val His Gin lie 

2325 2330 2335 

Ala Leu Arg Met Glu Val Leu Gly Gys Glu Ala Gin Asp Leu Tyr 
2340 2345 2350 



(2) INFORMATION FOR SEQ ID NO: 46: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4832 base pairs 

(E) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

{B } TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

CTCGAGCTAA AGATATTTTA GAGAAGAATT AACCTTTTGC TTCTCCAGTT GAACATTTGT 6 0 

AGCAATAAGT CATGCAAATA GAGCTCTCCA CCTGCTTCTT TCTGTGCCTT TTGCGATTCT 12 0 

GCTTTAGTGC CACCAGAAGA TACTACCTGG GTGCAGTGGA ACTGTCATGG GACTATATGC 13 0 

AAAGTGATCT CGGTGAGCTG CCTGTGGACG CAAGATTTCC TCCTAGAGTG CCAAAATCTT 24 0 

TTCCATTCAA CACCTCAGTC GTGTACAAAA AGACTCTGTT TGTAGAATTC ACGGATCACC 3 00 

TTTTCAACAT CGCTAAGCCA AGGCCACCCT GGATGGGTCT GCTAGGTCCT ACCATCCAGG 360 

CTGAGGTTTA TGATACAGTG GTCATTACAC TTAAGAACAT GGCTTCCCAT CCTGTCAGTC 42 0 

TTCATGCTGT TGGTGTATCC TACTGGAAAG CTTCTGAGGG AGCTGAATAT GATGATCAGA 480 

CCAGTCAAAG GGAGAAAGAA GATGATAAAG TCTTCCCTGG TGGAAGCCAT ACATATGTCT 54 0 

GGCAGGTCCT GAAAGAGAAT GGTCCAATGG CCTCTGACCC ACTGTGCCTT ACCTACTCAT 600 

ATCTTTCTCA TGTGGACCTG GTAAAAGACT TGAATTCAGG CCTCATTGGA GCCCTACTAG 66 0 

TATGTAGAGA AGGGAGTCTG GCCAAGGAAA AGACACAGAC CTTGCACAAA TTTATACTAC 72 0 

TTTTTGCTGT ATTTGATGAA GGGAAAAGTT GGCACTCAGA AACAAAGAAC TCCTTGATGC 78 0 

AGGATAGGGA TGCTGCATCT GCTCGGGCCT GGCCTAAAAT GCACACAGTC AATGGTTATG 84 0 

TAAACAGGTO TCTGCCAGGT CTGATTGGAT GCCACAGGAA ATCAGTCTAT TGGCATGTGA 9 00 

TTGGAATGGG CACCACTCCT GAAGTGCACT CAATATTCCT CGAAGGTCAO ACATTTCTTG 96 0 

TGAGGAACOA TOGCCAGGCG TCCTTGGAAA TCTCGCCAAT AACTTTCCTT ACTGCTCAAA 102 0 

CACTCTTGAT GGACCTTGGA CAGTTTCTAC TGTTTTGTCA TATCTCTTCC CACCAACATG 108 0 

ATGGCATGGA AGCTTATGTC AAAGTAGACA GCTGTCCAGA GGAACCCCAA CTACGAATGA 114 0 

AAAATAATGA AGAAGCGGAA GACTATGATG ATGATCTTAC TGATTCTGAA ATGGATGTGG 12 00 

TCAGGTTTGA TGATGACAAC TCTCCTTCCT TTATCCAAAT TCGCTCAGTT GCCAAGAAGC 12 6 0 



atcctaaaa;: tt^ggtagat ta:attgctg gtgaagagga ggactggsac tatggtgggt 1^20 

tajTcctcj^ :c::3ATjAC agaagttata aaagt:aata tttgaa:aat ggg:;:tgagg i-m-.i 

ggattggtag 3aagtagaaa aaagtcc 3a t ttatg 3 2ata gacagatgaa aggtttaaga 144') 

ct:gtgaag: tatt:ag:at ^aat::ag3aa t:ttg3 3A2g tt tactttat ggg ;aa :r: r ;• iv.m 

gaga:agagt gttgattata ttt.aaga.at: aa^gaa^gag agcatataa: atgtagggtg i : -.u 

AGGGAATGAG TGATGTCCGT CCTTTGTATT GAAGGAGATT AGGAAAAGGT GTAAAAGATT 1' ::m 

TGAAGGATTT TGGAATTGTG GGAGGAGAAA TATTGAAATA TAAATGGACA GT3ACTGTAG 1 f/ 1 1 

AAGATGGGGC AACTAAATCA GATCCTCGGT GCCTGACCCG GTATTAGTGT AGTTTCGTTA 174 U 

a tatg jAGa j agat:taggt tgaggagtga ttgg:ggtgt ggt:at:tgg TAGAAAGAAT l-.Oii 

CTGTAGATCA AAGA 3GAAAG GAGATAATGT CAGACAAGAG GAATGTCATC CTGTTTTCT j 1*0 U 

tatttgatga gaagggaagg tggtaggtca gagagaatat agaag jGttt gtggggaat: 1.00 
gagctggagt ggag :ttgag gatggagagt tggaagggtg caacatcatg gagaggatga 

atgggtatgt ttttgatagt ttgcagttgt gagtttgttt gcatgaggtg ggatagtggt 2 04 0 

agattgtaag gatt 3gagga gagagtgagt tggtttgtgt gttgttgtgt ggatataggt 2 1.00 

tgaaagagaa aatggtgtat gaagagagag tcagcctatt cgcattctca ggagaaactg 2 16 0 

t:tt:atgt: gat^gaaaag ggaggtgtat ggattgt 3g j gt^:gagaa: tgagagttp: 2220 

ggaagagagg catgaggggc ttactgaagg tttgtagttg tgacaagaac actggtgatt ::.:ao 

atta:gag:a gagttatgaa gatatttcag catacttggt gagtaaaaag aat 3ggattg 2 34 0 

aacoaagaag cttctcgcag aagggaggag tgttgaaagg ccatcaacgg gaaataa 3tg ::4 00 

gtagtagtgt tcagtcagat gaagaggaaa ttgagtatga tgataggata tcagttgaaa 24 6 0 

tgaagaagga a 3attttgag atttatgat3 aggatgaaaa tgagaggggg gggaggtttg 2 52 0 

aaaagaaaag aggagagtat tttattgctg cagtggagag ggtgtgggat tatgggatga 258 0 

gtagctcgcc ag at 3ttgta agaaagaggg gtgagagtgg gagtgtgggt gagtt :aaga 2 64 0 

aagttgtttt cg aggaattt actgatggct cctttactca gcccttatac ggtggagaag 2 700 

taaatgaaca tttgggagtg gtggggggat atataagagg agaagttgaa gat aat at 2 a 2 76 0 

tggtaacttt cagaaatcag gggtgtggtg cctattcctt ctattctagc gttatttctt 282 0 

atgaggaaga tcagaggcaa ggaggagaag gtagaaaaaa gtttgtgaag ggtaatgaaa 2nft0 

ccaaaactta cttttggaaa gtgcaacatc atatgggagg gagtaaagat gagtttgact 2 04 0 

gcaaagcctg ggcttatttc tctgatgttg acctggaaaa agatgtggac tcaggcctga 3 000 

ttggaggggt tgtggtgtgg cacactaaca cactgaaccc tggtcatggg agacaagtga 3 0 6 0 

gagtagagga atttgctgtg tttttcacca tctttgatga gaggaaaagc tggtagttga 312 0 

gtgaaaatat ggaaagaaac tgcagggctc cctgcaatat ccagatggaa gatcccactt 3 18 0 

ttaaagagaa ttatcgcttc catgcaatca atgggtagat aatggataca gtaggtgggt 324 0 

tagtaatggc tcaggatcaa ag gatt c gat ggtatctggt cagcatgggc agcaatgaaa 3 3 00 

acatccattc tattcatttc agtggacatg tgttcactgt aggaaaaaaa gaggagtata 3 360 

aaatggcact gtagaatgtc tatccaggtg tttttgagac agtggaaatg ttaccatcca 34 2 0 

aaggtggaat ttggggggtg gaatgcctta ttggggagga tctacatgct gggatgagga 34 ft 0 

cactttttct ggtgtacagc aataagtgtc agactcccct gggaatgggt tgtggagaga 354 0 

ttagagattt t gag att ag a gcttoaggag aatatggaca gtgggcccca aagctggcca 3 ^'00 

gacttcatta ttggggatga atcaatgcct ggag gag gaa ggagcccttt tgttggatca 3660 

aggtggatct gttgggagga atgattattg aggggatgaa gacccagggt gggggtcaga 3 72 0 

AGTTGTGGAG CCTCTACATC TCTCAGTTTA TCATCATGTA TAGTCTTGAT GGGAAGAAGT 378 0 

GGGAGAGTTA TCGAGGAAAT TGCACTGGAA CCTTAATGGT CTTCTTTGGC AATGTGGATT 384 0 

CATCTGGGAT AAAAGAGAAT ATTTTTAAGG GTGGAATTAT TGGTCGATAC ATGGGTTTGG 3 ',00 

ACCCAACTCA TTATAGGATT CGCAGCACTC TTCGCATGGA GTTGATGGGC TGTGATTTAA 3^60 

ATAGTTGCAG CATGCCATTG GGAATGGAGA GTAAAGGAAT ATGAGATGGA GAGATTAGTG 4 02 0 

CTTCATCCTA CTTTACCAAT ATGTTTGCCA CCTGGTCTCC TTGAAAAGGT CGACTTCACC 4u8 0 

TCCAAGGGAG GAGTAATGCC TGGAGACCTC AGGTGAATAA TCCAAAAGAG TGGGTGGAAG 414 0 

TGGAGTTGGA GAAGAGAATG AAAGTCACAG GAGTAACTAC TGAGGGAGTA AAATCTCTGG 4 2 00 

TTACCAGCAT GTATGTGAAG GAGTTCCTCA TGTGGAGGAG TCAAGATGGC CATCAGTGGA 426 0 

CTCTCTTTTT TGAGAATGGG AAAGTAAAGG TTTTTGAGGG AAATCAAGAC TCCTTCACAC 4 32 0 

CTGTGGTGAA CTCTCTAGAC GCAGGGTTAG TGACTCGCTA CCTTCGAATT CACCCCCAGA 4380 

GTTGGGTGCA CCAGATTGCC GTGAGGATGG AGGTTCTGGG CTGCGAGGCA CAGGAGCTCT 444 0 

ACTGAGGGTG GGGAGTGGAG GAGGTGGGAG TGCGGTGAGC TCTCCCTGCT CAGCTCCAGG 4 50 0 



GCAGTGTCCC TCCCTGGCTT GCCTT2TACC TTTGTGCTAA ATCCTAGCA j ACACTGCCTT 4 56 0 

GAAGCCTOCT GAATTAAOTA TCATCAGTC 2 TGCATTTOTT TGGTG3GGG3 CCAGGAGGGT 4 62 0 

GCATCCAATT TAACTTAACT CTTACCTATT TTCTGCAGCT GCTCC2AGAT TA2TCCTTCO 4 68 0 

TTCCAATATA ACT AG GC AAA AAGAAGTGAG GAGAAA20TG CATGAAAGCA TTJTTCCCTG 4 74-0 

AAAAGTTAG G CCTCTCAGAG TCACCACTTC CTCTGTTGTA GAAAAACTAT GTGATGAAAC 4 800 

TTTGAAAAAG ATATTTATGA TGTTGCGGCC GG 4 8 32 

(2) INFORMATION FOR SEQ ID NO : 4 7 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 57 amino a -ids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
(O) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 7 : 

Met Gin lie Glu Leu Ser Thr Cys Phe Phe Leu Cys Leu Leu Arg Phe 
1 5 10 15 

Cys Phe Ser Ala Thr Arg Arg Tyr Tyr Leu Gly Ala Val Glu Leu Ser 
2 0 2 5 3 0 

Trp Asp Tyr Met Gin Ser Asp Leu Gly Glu Leu Pro Val Asp Ala Arg 
3 5 4 0 4 5 

Phe Pro Pro Arg Val Pro Lys Ser Phe Pro Phe Asn Thr Ser Val Val 
50 55 60 

Tyr Lys Lys Thr Leu Phe Val Glu Phe Thr Asp His Leu Phe Asn lie 
65 70 75 80 

Ala Lys Pro Arg Pro Pro Trp Met Gly Leu Leu Gly Pro Thr lie Gin 

85 90 95 

Ala Glu Val Tyr Asp Thr Val Val lie Thr Leu Lys Asn Met Ala Ser 
100 105 110 

His Pro Val Ser Leu His Ala Val Gly Val Ser Tyr Trp Lys Ala Ser 
115 120 125 

Glu Gly Ala Glu Tyr Asp Asp Gin Thr Ser Gin Arg Glu Lys Glu Asp 
130 135 140 

Asp Lys Val Phe Pro Gly Gly Ser His Thr Tyr Val Trp Gin Val Leu 
145 150 155 160 

Lys Glu Asn Gly Pro Met Ala Ser Asp Pro Leu Cys Leu Thr Tyr Ser 

165 170 175 

Tyr Leu Ser His Val Asp Leu Val Lys Asp Leu Asn Ser Gly Leu lie 
180 185 190 



Gly Ala Leu Leu Val Cys Arg Glu Gly Ser Leu Ala Lys Glu Lys Thr 
195 200 205 



Gin Thr Leu His Lys Phe lie Leu Leu Phe Ala Val Phe Asp Glu Gly 

210 215 220 

Lys Ser Trp His Ser Glu Thr Lys Asn Ser Leu Met Gin Asp Arg Asp 

225 230 235 240 



Ala Ala Ser Ala Arg Ala Trp Pro 

24 5 

Val Asn Arg Ser Leu Pro Gly Leu 

260 

Tyr Trp His Val lie 
275 

Phe Leu Glu G Ly His 
290 

Leu Glu lie Ser Pro lie Thr Phe 
3 05 310 

Asp Leu Gly Gin Phe Leu Leu Phe 

325 



Lys Met His Thr Val Asn Gly Tyr 

250 255 

lie Gly Cys His Arg Lys Ser Val 
265 270 



Leu Thr Ala Gin Thr Leu Leu Met 
315 32 0 

Cys His lie Ser Ser His Gin His 
330 335 



Gly Met Gly Thr Thr Pro Glu Val His Ser lie 
2 3 0 2 3 5 

Thr Phe Leu Val Arg Asn His Arg Gin Ala Ser 
295 300 



Asp Gly Met Glu Ala Tyr Val Lys Val Asp Ser Cys Pro Glu Glu Pro 
34 0 345 350 

Gin Leu Arg Met Lys Asn Asn Glu Glu Ala Glu Asp Tyr Asp Asp Asp 
355 360 365 

Leu Thr Asp Ser Glu Met Asp Val Val Arg Phe Asp Asp Asp Asn Ser 
370 375 380 

Pro Ser Phe lie Gin lie Arg Ser Val Ala Lys Lys His Pro Lys Thr 
385 390 395 400 

Trp Val His Tyr He Ala Ala Glu Glu Glu Asp Trp Asp Tyr Ala Pro 

405 410 415 

Leu Val Leu Ala Pro Asp Asp Arg Ser Tyr Lys Ser Gin Tyr Leu Asn 
420 425 430 

Asn Gly Pro Gin Arg He Gly Arg Lys Tyr Lys Lys Val Arg Phe Met 
435 440 445 

Ala Tyr Thr Asp Glu Thr Phe Lys Thr Arg Glu Ala He Gin His Glu 
450 455 460 

Ser Gly He Leu Gly Pro Leu Leu Tyr Gly Glu Val Gly Asp Thr Leu 
465 470 475 480 



Leu He He Phe Lys Asn Gin Ala Ser Arg Pro Tyr Asn He Tyr Pro 

485 490 495 



# 



His Gly lie Thr Asp Vai Arg Pro Leu Tyr Ser Arg Arg Leu Pro Lys 
500 505 510 

Gly Val Lys His Leu Lys Asp Phe Pro He Leu Pro Gly Glu He Phe 
515 520 525 

Lys Tyr Lys Trp Thr Val Thr Val Glu Asp Gly Pro Thr Lys Ser Asp 

530 535 54 0 

Pro Arg Cys Leu Thr Arg Tyr Tyr Ser Ser Phe Val Asn Met Glu Arg 
545 550 555 560 

Asp Leu Ala Ser Gly Leu He Gly Pro Leu Leu He Cys Tyr Lys Glu 

565 570 575 

Ser Val Asp Gin Arg Gly Asn Gin He Met Ser Asp Lys Arg Asn Val 

580 53 5 5 90 

He Leu Phe Ser Val Phe Asp Glu Asn Arg Ser Trp Tyr Leu Thr Glu 
595 600 605 

Asn He Gin Arg Phe Leu Pro Asn Pro Ala Gly Val Gin Leu Glu Asp 
610 615 620 

Pro Glu Phe Gin Ala Ser Asn He Met His Ser lie Asn Gly Tyr Val 
625 630 635 64 0 

Phe Asp Ser Leu Gin Leu Ser Val Cys Leu His Glu Val Ala Tyr Trp 

645 650 655 

Tyr He Leu Ser He Gly Ala Gin Thr Asp Phe Leu Ser Val Phe Phe 

660 665 670 

Ser Gly Tyr Thr Phe Lys His Lys Met Val Tyr Glu Asp Thr Leu Thr 
675 680 685 

Leu Phe Pro Phe Ser Gly Glu Thr Val Phe Met Ser Met Glu Asn Pro 
690 695 700 

Gly Leu Trp He Leu Gly Cys His Asn Ser Asp Phe Arg Asn Arg Gly 
705 710 715 720 

Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr Gly Asp 

725 730 735 

Tyr Tyr Glu Asp Ser Tyr Glu Asp lie Ser Ala Tyr Leu Leu Ser Lys 
740 745 750 

Asn Asn Ala He Glu Pro Arg Ser Phe Ser Gin Asn Pro Pro Val Leu 
755 76 0 765 



Lys Arg His Gin Arg Glu He Thr Arg Thr Thr Leu Gin Ser Asp Gin 
770 775 780 



• 



Glu Hu He Asp Tyr Asp Asp Thr He Ser Val Glu Met Lys Lys Glu 

785 790 795 800 

Asp Phe Asp He Tyr Asp Glu Asp Glu Asn Gin Ser Pro Arg Ser Phe 

8 05 810 815 

Gin Lys Lys Thr Arg His Tyr Phe He Ala Ala Val Glu Arg Leu Trp 

8 2 0 8 2 5 8 3 0 

Asp Tyr Gly Met Ser Ser Ser Pro His Val Leu Arg Asn Arg Ala Gin 
335 840 845 

Ser Gly Ser Val Pro Gin Phe Lys Lys Val Val Phe Gin Glu Phe Thr 

350 355 860 

Asp Gly Ser Phe Thr Gin Pro Leu Tyr Arg Gly Glu Leu Asn Glu His 
365 870 375 880 

Leu Gly Leu Leu Gly Pro Tyr He Arg Ala Glu Val Glu Asp Asn He 

885 890 395 

Met Val Thr Phe Arg Asn Gin Ala Ser Arg Pro Tyr Ser Phe Tyr Ser 
900 905 910 

Ser Leu He Ser Tyr Glu Glu Asp Gin Arg Gin Gly Ala Glu Pro Arg 
915 920 925 

Lys Asn Phe Val Lys Pro Asn Glu Thr Lys Thr Tyr Phe Trp Lys Val 
930 935 940 

Gin His His Met Ala Pro Thr Lys Asp Glu Phe Asp Cys Lys Ala Trp 
945 950 955 960 

Ala Tyr Phe Ser Asp Val Asp Leu Glu Lys Asp Val His Ser Gly Leu 

965 970 975 

lie Gly Pro Leu Leu Val Cys His Thr Asn Thr Leu Asn Pro Ala His 

980 985 990 

Gly Arg Gin Val Thr Val Gin Glu Phe Ala Leu Phe Phe Thr He Phe 
995 1000 1005 

Asp Glu Thr Lys Ser Trp Tyr Phe Thr Glu Asn Met Glu Arg Asn Cys 
1010 1015 1020 

Arg Ala Pro Cys Asn lie Gin Met Glu Asp Pro Thr Phe Lys Glu Asn 
1025 1030 1035 1040 

Tyr Arg Phe His Ala lie Asn Gly Tyr lie Met Asp Thr Leu Pro Gly 

1045 1050 1055 



Leu Val Met Ala Gin Asp Gin Arg lie Arg Trp Tyr Leu Leu Ser Met 
1060 1065 1070 



Gly Ser Asn GIu Asn lie His Ser lie His Phe Ser Gly His Val Phe 
1075 1080 1035 

Thr Val Arg Lys Lys Glu Glu Tyr Lys Met Ala Leu Tyr Asn Leu Tyr 
10 9 0 10 9 5 1 L 0 0 

Pro Gly Val Phe Glu Thr Val Glu Me- Leu Pro Ser Lys Ala Gly He 
1105 1110 1115 112 0 

Trp Arg Val Glu Gys Leu He Gly Glu His Leu His Ala Gly Met Ser 

1125 1130 1135 

Thr Leu Phe Leu Val Tyr Ser Asn Lys Cys Gin Thr Pro Leu Gly Met. 
1140 1145 1150 

Ala Ser Gly His He Arg Asp Phe Gin He Thr Ala Ser Gly Gin Tyr 
1155 1160 116 5 

Gly Gin Trp Ala Pro Lys Leu Ala Arg Leu His Tyr Ser Gly Ser He 
1170 1175 1180 

Asn Ala Trp Ser Thr Lys Glu Pro Phe Ser Trp He Lys Val Asp Leu 
1185 1190 1195 1200 

Leu Ala Pro Met He He His Gly lie Lys Thr Gin Gly Ala Arg Gin 

1205 1210 1215 

Lys Phe Ser Ser Leu Tyr He Ser Gin Phe He He Met Tyr Ser Leu 
1220 1225 1230 

Asp Gly Lys Lys Trp Gin Thr Tyr Arg Gly Asn Ser Thr Gly Thr Leu 
1235 1240 1245 

Met Val Phe Phe Gly Asn Val Asp Ser Ser Gly He Lys His Asn He 
1250 1255 1260 

Phe Asn Pro Pro He He Ala Arg Tyr He Arg Leu His Pro Thr His 
1265 1270 1275 1280 

Tyr Ser He Arg Ser Thr Leu Arg Met Glu Leu Met Gly Gys Asp Leu 

1285 1290 1295 

Asn Ser Gys Ser Met Pro Leu Gly Met Glu Ser Lys Ala lie Ser Asp 
1300 1305 1310 

Ala Gin lie Thr Ala Ser Ser Tyr Phe Thr Asn Met Phe Ala Thr Trp 
1315 1320 1325 

Ser Pro Ser Lys Ala Arg Leu His Leu Gin Gly Arg Ser Asn Ala Trp 
13 30 1335 1340 



Arg Pro Gin Val Asn Asn Pro Lys Glu Trp Leu Gin Val Asp Phe Gin 
1345 1350 1355 1360 



Lys Thr Met Lys Val Thr Gly Val Thr Thr Gin Gly Val Lys Ser Leu 

13 6 5 1370 1375 

Leu Thr Ser Met: Tyr Val Lys Glu Phe Leu lie Ser Ser Ser Gin Asp 
13 8 0 13 8 5 13 9 0 

Gly His Gin Trp Thr Leu Phe Phe Gin Asn Gly Lys Val Lys Val Phe 
1395 1400 1405 

Gin Gly Asn Gin Asp Ser Phe Thr Pro Val Val Asn Ser Leu Asp Pro 
1410 1415 1420 

Pro Leu Leu Thr Arg Tyr Leu Arg lie His Pro Gin Ser Trp Val His 
1425 1430 1435 1440 

Gin lie Ala Leu Arg Met Glu Val Leu Gly Gys Glu Ala Gin Asp Leu 

1445 1450 1455 

Tyr 



(2) INFORMATION FOR SEQ ID NO: 48: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 amino acids 
(3) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

Arg Gly Met Thr Ala Leu Leu Lys Val Ser Ser Cys Asp Lys Asn Thr 
15 10 15 

Gly Asp Tyr Tyr Glu Asp Ser Tyr Glu Asp lie Ser Ala Tyr Leu Leu 
20 25 30 

Ser Lys Asn Asn Ala lie Glu Pro Arg Ser Phe Ser Gin Asn Ser Arg 
35 40 45 

His Pro Ser Thr Arg Gin Lys Gin Phe Asn Ala Thr Pro Pro Thr Pro 
50 55 h0 

Pro Thr Pro Pro Val Leu Lys Arg His Gin Arg Glu lie Thr Arg Thr 
65 70 75 80 

Thr Leu Gin Ser Asp Gin Glu Glu lie Asp Tyr Asp Asp Thr lie Ser 

85 90 95 

Val Glu Met Lys 
100 



(2) INFORMATION FOR SEQ ID NO : 4 9 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 'CO base pairs 
(?. ) TYPE: nu - lei-:: acii 

(C) STRANDEDNESS : single 
( D ) TOPOLOGY : 1 inear 

(ii) MOE3:ULE TYPE : DNA (■genomic) 

(xi) sequence des 2r i pt i ont : seq id no : 4 3 : 

agaggcatga ccgccttact gaaggtttct agttgtgaca agaacactgg tgattattac 6 0 

gaggacagtt atgaagatat ttcagcatac ttgctgagta aaaacaatgc cattgaacca 12 0 

agaagcttot ccoagaattc tagacaccct agcactaggc aaaagcaatt taatgccac: 1.-30 

cotoctacac caocaacccc accagtactg aaacgccatc aacgggaaat aactcgtact 24 0 

act:ttcagt ctgatcaa:;a ggaaattgac tatgatgata ccatatcagt tgaaatgaag 300 

(2) information for seq id no: 50: 

(l) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 7 amino acids 

( B ) TYPE: am i no a c i d 

(O STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Ser Arg His Pro Ser Thr Arg Gin Lys GLn Phe Asn Ala Thr Pro Pro 
1 5 10 15 

Val Leu Lys Arg His Gin Arg Glu lie Thr Arg 
2 0 2 5 



(2) INFORMATION FOR SEQ ID NO : 5 1 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: Si base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 1 : 
TCGCGACACC CTAGCACTAG GCAAAAGCAA TTTAATGCCA CCCCACCAGT CCTGAAACGC 6 0 

CATCAACGGG AAATAACGCG T 81 



(2) INFORMATION FOR SEQ ID NO: 52: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
ACTACTCTTC AATCTGATCA AGAGGAA 2 7 



(2) INFORMATION FOR SEQ ID NO: 53: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 43 base pairs 



(B) TYPE: nucleic acid 
(C.) STRANrEDMESS : single 
(D) TjPOLOGY: linear 
(ii> MOLETULE TYPE: DMA (genoTiic) 

(xi) sequence description: seq id no: 53: 
cgcjC:gctc gajTCTacaa t^gctttgcc ttttggttta ctg 

(2) information for seq id no: 54: 
{ i > sequence characteristics : 

(A) LENGTH: -13 base pairs 

( B) TYPE : nuclei c acid 

(C) STRAND SDN ESS : single 
CI-) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DMA (genomic) 
(xi) SEQUENCE DESCRIPTI I'N : SE;i ID N:):54: 
GOGCCCATCG ATTTATTOCT TGCTGGTTAA CCTTTCTTGC AAG 

(2) INFORMATION FOR SEQ ID NO : 5 5 : 
(L) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 4 8 base pairs 
( B- ) TYPE : nuclei c a cid 
(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 
(ill MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
CGCGCCGCTC GAG CATC CCA ATGGCCCTGT CCTTTTCTTT ACTGATGG 

C:) INFORMATION FOR SEQ ID NO:5b: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
CCATCGATTC AATCCTTCCT CCTTAATCTT TTTTGCAAG 



(2) INFORMATION FOR SEQ ID NO: 57: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
CCGGATCCTC TACAATGGCT TTGCCTTTTG CTTTACTG 

(2) INFORMATION FOR SEQ ID NO: 58: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DMA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

r^cocccGCG g:cgcttatt ccttcctcct taaccttt:t tgcaag 



46 



(2) INFORMATION FOR SEQ ID NO: 59: 
( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 4 3 base pairs 

(B) TYPE: nucleic arid 

(C) STRANDEDNESS : single 

(D) TOPOL0 3Y: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) sequence description: seq id no: 59: 
c::ggat:cca tcccaatggc cctgtccttt tctttactga tgg 43 

(2) information for seq id no: 60: 
(i) sequence characteristics: 

(A) LENGTH: 46 base pairs 

(EG TYPE: nucleic acid 

(CO STRANDEDNESS: single 

(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 0 : 
CGCGCCGGCG GCCGCTCAAT CCTTCCTCCT TAATCTTTTT TGCAAG 4 6 

(2) INFORMATION FOR SEQ ID NO : 6 1 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 90 base pairs 

( B ) TYPE: nucleic acid 
(O STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 1 : 
AGCTTGCTGT TTGTGTGCTG CCTCTGAAGT CCACACTGAA CAAACTTCAG CCTACTCATG 6 0 

TCCCTAAAAT GGGCAAACAT TGCAAGCAGC 9 0 

(CO INFORMATION FOR SEQ ID NO: 62: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 7 0 base pairs 
(PC TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi.) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

AAACAGCAAA CACACAGCCC TCCCTGCCTG CTGACCTTGG AGCTGGGGCA GAGGTCAGAG 6 0 

ACCTCTCTGA 7 0 

(2) INFORMATION FOR SEQ ID NO:63: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 82 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 3 : 




AGCTTCA CAG AGGTCTCTGA C 2TCT 'JC CCC AGCTCCAAGG TCAGCAGGCA GGGAGGGCTG 6 0 

TGTGTTTjCT GTTTGCTGGT 7 j 8 2 

(2) INFORMATION FOR SEQ ID NO: 64: 
■i) sequsn:e CHARACTERISTICS: 

(A) LENGTH: 7 8 base pairs 

(B) TYPE: nucleic acid 

(C) 8TRAND2DNESS : single 

( D ) TOPOLOGY: linear 

(ii> MOLECULE TYPE: DNA (genomic) 

(xi) S SQUENCE DESCRIPTION: SEQ ID NO: 64: 

CAATGTTTGC CCATTTTA 2tG GACATGAGTA GG 2TGAAGTT TGTTCAGTGT GGACTTCAGA 6 0 

GGCAGCACAO AAAOAGCA 7 8 

(2) INFORMATION F2R GEO ID N0:»-5: 
(i) SEQUENCE CHARACTERISTICS : 
(A) LENGTH: ;<0 base pairs 
{ B ) TYPE : nucleic aci i 
(0) STRAND EDN ESS : single 
{ D ) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA {genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 5 : 
AATTCGCTGT TTGTGTGCTG OOTCTGAAGT CCACACTGAA CAAACTTCAG CCTACTCATG 6 0 

TCCCTAAAAT GGGCAAACAT TGCAAGCAGC 9 0 

(2} INFORMATION FOR SEQ ID NO: 66: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDSDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
AAACAGCAAA CACACAGCCC TCCCTGCCTG CTGACCTTGG AGCTGGGGCA GAGGTCAGAG 6 0 

ACCTCTCTGG 7 0 

(2.1 INFORMATION FOR SEQ ID NO: 67: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 82 base pairs 
(3) TYPE: nucleic acid 
(0) STRANDEDNESS : single 
(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 7 : 
AATTCCAGAG AGGTCTCTGA CCTCTGCCCC AGCTCCAAGG TCAGCAGGCA GGGAGGGCTG 6 0 

TGTGTTTGCT GTTTGCTGCT TG 8 2 



(2) INFORMATION FOR SEQ ID NO : 6 8 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPC'LOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 

kxi) SEQUENCE DESCRIPTION: SEj ID NO:68: 

caatjTttgc ccattttagg ga::atgagta jSZtgaastt tgttcagtgt ggacttcaga eo 

GGOATJACAO AAACAGCG 7 8 



(2) INFORMATION FOR GEO ID MO : 6 d : 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 3 base pairs 
( b ) TYPE: nucleic 5i c i. d 
(0) STRAND EDN ESS : single 
(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : DNA (genomic) 

(xi) sequence description': seq id no: 69: 
:gcgccgcco ggggtagatc ttgctaccag tgg 33 



C!) INFORMATION FOR SEQ ID NO: 70: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 40 base pairs 
(6) TYPE: nuclei- a -id 
(C> STRANDEDNESS : single 
(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 
GCGCCCGCGG CCGOCACTGT CCCAGGTCAG TGGTGGTGCC 4 0 



(2) INFORMATION FOR SEQ ID NO : 7 I : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

( B ) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
CGCGCCGGCG GCCGCTCTAC AATGGCCTTG ACCTTTGCTT TACTGG 4 6 



{2) INFORMATION FOR SEQ ID NO: 72: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 43 base pairs 
(5) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 
GCGCCCATCG ATTCATTCCT TACTTCTTAA ACTTTCTTGC AAG 4 3 



(2) INFORMATION FOR SEQ ID NO : 7 3 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 
(E) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
CGCGCCGGCG GCCGCTCTAC AATGGCCTTG ACCTTTGCTT TACTGG 4 6 




C>) INFORMATION FOR 3EJ ID MO: 74: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENOTH: 4 3 bai?e puirs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS: single 
i P) TjPOLjGY: linear 

(ii) MOLECULE TYPE: DKA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:74: 
GCGCCCATCG ATTCATTOOT TACTT2TTAA ACTTTCTTGC AAG 4 3 

CI) INFORMATION FOR SEQ ID NO: 75: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH : 6 amino acids 
IB) TYPE): amine acid 
(G) STRANDEDNESS : single 

( D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: 
Asn Ser Arg His Pro Ser 
1 5 

(2) INFORMATION FOR SEQ ID NO: 76: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE : amino a:' id 

(0) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: protein 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

Gin Asn Pro Pro Val Leu Lys Arg His Gin Arg Glu lie Thr 
15 10 

(2) INFORMATION FOR SEQ ID NO: 77: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:77 : 
AATTCGCGAC ACCCTAGC 18 

(2) INFORMATION FOR SEQ ID NO: 78: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 42 base pairs 
(P) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO': 78: 
CAAAACCCAC CAGTCTTGAA ACGCCATCAA CGGGAAATAA CG 4 2 



(2) INFORMATION FOR SEQ ID NO : 7 9 : 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 base pairs 

(B) TYPE: nu-Leic 

(2) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: ON A (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NJ:79: 
GOGCTGTGGG ATCGGTTTTG GGTGGTCAGA AC 3 2 

(2) INFORMATION E jR SEQ ID NO: 30: 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 4 base pairs 
iB) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
ID) TjPuLOGY: linear 

(iii MOLEI'ULE TYPE: ON A (genomic) 
(xi) SEQUENCE DESCRIPTION: SEC ID NO : 8 0 : 
TTTGCGGTAG TTGCCCTTTA TTGC 2 4 

(2) INFORMATION FOR SEQ ID NO : 8 I : 
(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: b amino acids 
{&) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: 3EQ ID NO : 8 1 : 
Arg Thr Leu Gin Ser Asp 
1 5 

(2) INFORMATION FOR SEQ ID NO:82: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 
CGTACTCTTC AGTCT 15 

(2) INFORMATION FOR SEQ ID NO : 8 3 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 3 : 
GCATGAGAAG TC AG AC TAG 19 

(2) INFORMATION FOR SEQ ID NO: 84: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 



, : (C) STRAND EDNESS : single 

\D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 
< ' TGCATGCCTG CAGGTC 

■ ( 



